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FINAL  REPORT:  DAMD17-02-1-0631 

TITLE:  A  Novel  Strategy  for  Controlling  the  Metastatic  Phenotype:  Targeting  the 

SNAG  Repression  Domain  in  the  SNAIL  Zinc-Finger  Protein 

Statement  of  Work: 

Task  1:  Reconstitute,  map  and  determine  the  specificity  of  the  SNAIL-SNAG 

Domain/SNAP  interaction  in  vitro  and  in  vivo.  (Months  1-18) 

a.  Isolate  full-length  SNAP  and  SNAIL  cDNAs  and  create  a  panel  of  deletion  and 
point  mutations  based  on  domain  boundaries  and  subclone  these  constructs  to 
bacterial  and  mammalian  expression  vectors  (constitutive  and  inducible),  which 
incorporate  epitope  tags.  Purify  bacterially  expressed  antigens,  raise  polyclonal 
antibodies,  and  evaluate  their  utility  and  specificity  in  biochemical  assays  of  purified 
and  cell-derived  proteins.  (Months  1-12) 

Accomplishments: 

This  aim  has  been  completely  accomplished.  We  have  obtained  and/or  cloned 
ourselves  and  subsequently  subcloned  cDNAs  for  SNAIL  and  the  SNAP  protein  AJUBA 
and  assembled  a  vast  array  of  bacterial  and  mammalian  expression  vectors  using  both 
epitope  tagged  and  non-epitope  tagged  vectors.  Each  of  these  constructs  has  been 
subject  to  extensive  truncation  and  point  mutagenesis  in  order  to  fine  map  interaction 
domains.  Antibodies  to  AJUBA  have  been  raised  using  purified  bacterial  protein  using 
6-histidine  fusion  technology.  These  antibodies  have  been  affinity  purified  and  are 
highly  active  in  almost  every  assay  we  have  tried  including  IHC,  IF, 
Immunoprecipitation,  and  Chromatin  Immunoprecipitation  (CHIP)  using  protein 
sources  from  both  cells  and  recombinant  sources.  These  reagents  have  been  distributed 
freely  to  the  research  community.  We  have  had  a  bit  of  trouble  getting  these  antibodies 
to  work  on  paraffin  embedded  tissue  specimens,  however  this  is  not  uncommon  and  we 
continue  to  optimize  this  procedure.  We  have  determined  that  they  do  not  cross-react 
with  other  LIM  domain  proteins  in  all  procedures  and  have  utilized  them  as  described 
below. 

b.  Perform  in  vitro  interaction  assays  using  a  combination  of  GST  chromatography,  co- 
immunoprecipitation  Perform  in  vivo  interaction  assays  using  transient  co¬ 
transfection  of  tagged  SNAP  and  SNAIL  expression  constructs  and  co- 
immunoprecipitation  of  cell  extracts  (as  above).  Perform  hybrid  in  vivo  /  in  vitro 
interaction  assays  combining  cell  derived  (transfected  or  endogenous)  proteins  with 
purified  protein  baits  in  GST  chromatographic  and  gel  shift/ supershift  assays. 
Evaluate  specificity  of  interactions  using  other  SNAP  family  members.  (Months  6- 
18) 

Accomplishments: 

This  aim  has  been  almost  completely  accomplished.  Using  a  combination  of  the 
antibodies  produced  (described  above)  and  affinity  chromatography  assays,  we  have 
efficiently  reconstituted  the  SNAG  domain- SNAP  interaction  in  vitro  and  in  vivo.  Co¬ 
transfections  of  LEXA  and  GAL4  fusion  to  the  minimal  SNAG  domain  showed  robust 
interaction  between  the  AJUBA  SNAP  and  the  SNAG  domain.  This  binding  was 
dependent  upon  the  most  COOH  terminal  of  the  three  LIM  domains  in  AJUBA  and  did 
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not  occur  with  other  LIM  domain  proteins  of  the  AJUBA  family.  Thus,  there  is 
tremendous  specificity  for  interaction  among  these  very  highly  related  proteins  The 
SNAG  domain  of  SNAIL  is  required  for  AJUBA  binding  but  there  appears  to  be 
multiple  contacts  for  protein-protein  interaction  among  the  complete  300  amino  acid 
NH2  terminus  of  SNAIL.  Importantly  single  amino  acid  substitutions  in  the  SNAG 
domain  strongly  inhibited  both  AJUBA  binding  and  repression  implying  that  this 
interaction  is  critical  in  vivo.  To  confirm  this  we  have  also  shown  that  the  complex  of 
SNAIL  and  AJUBA  can  be  easily  detected  bound  to  the  e-cadherin  target  gene  using 
chromatin  immunoprecipitation  experiments.  Though  these  experiments  go  way 
beyond  the  statement  of  work,  they  are  the  gold  standard  to  show  that  a  repressor-co¬ 
repressor  interaction  (like  SNAIL- AJUBA)  occurs  at  an  endogenous  target  gene.  We 
have  now  shown  this  with  both  engineered  integrated  targets  and  endogenous  targets. 
Recapitulating  the  SNAG-SNAP  interaction  with  completely  purified  proteins  from 
bacteria  has  been  very  difficult  (see  below  also).  Simply  put,  both  the  SNAG  peptide 
(alone)  and  the  isolated  AJUBA  LIM  domain  are  profoundly  insoluble  proteins.  We 
tried  multiple  different  strategies  including  co-expression,  co-folding,  low  temperature 
induction  etc.  We  modeled  these  studies  after  our  highly  successful  expression  and 
reconstitution  of  another  zinc-finger  repression  domain,  the  KRAB  domain. 
Unfortunately,  these  peptides  behave  very  differently.  We  continue  to  optimize  this  in 
order  to  perform  structural  and  small  molecule  inhibitor  studies. 

Statement  of  Work: 

Task  2:  Identify  dominant  negative  and  peptide-based  inhibitors  of  the  SNAG- 

SNAP  interaction  and  use  them  as  tools  to  manipulate  SNAIL-mediated 
repression  and  E-Cadherin  expression  in  vivo.  (Months  8-30) 

a.  Generate  stable  cell  lines  with  integrated  metallothionein-inducible,  epitope  tagged 
SNAP  in  recipient  cells  with  low  E-Cadherin  expression  to  evaluate  whether  E- 
Cadherin  is  reactivated  by  induction  of  SNAP  expression  to  dominantly  interfere 
with  SNAIL  repression.  Clonal  inducible  cell  lines  will  be  assessed  in  for 
motility  /invasiveness  in  tissue  culture  based  assays,  and  for  metastases  in  SCID 
mice  based  models.  (Months  8-24) 

Accomplishments: 

This  aim  has  been  largely  accomplished  albeit  with  much  different  experimental 
strategies  and  systems  than  originally  proposed.  Initially  we  tried  to  produce  inducible 
SNAP  and  SNAIL  cell  lines  using  a  variety  of  inducible  plasmid  or  lentiviral  systems. 
All  of  them  proved  too  leaky  in  expression  and  resulted  in  rapid  mesenchymal 
differentiation  making  it  impossible  to  examine  clones.  Instead  we  turned  to  P19  and  F9 
embryonal  cells  that  express  high  levels  of  AJUBA.  Using  both  stable  and  transient 
siRNA  mediated  knockdown  technology,  we  have  shown  an  inverse  correlation 
between  e-cadherin  expression  and  AJUBA /SNAIL  expression.  Since  these  cell  lines 
could  not  be  evaluated  in  mice,  we  turned  to  a  collaboration  with  Dr  Gregg  Longmore 
of  Wash.  U.  St  Louis,  who  was  developing  the  xenopus  egg  system  for  looking  at  the 
role  of  AJUBA  and  SNAIL  in  early  development.  Together,  our  laboratories  have 
shown  that  morpholino-mediated  knockdown  of  either  SNAIL  or  AJUBA  is  required  for 
early  neural  crest  induction  and  that  the  interaction  of  thee  proteins  is  critical  for  that 
developmental  pathway. 
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a.  Create  SNAG  domain  cell  delivery  peptides  based  on  fusion  of  the  21  amino  acid 
SNAG  domain  to  TAT  and/or  antennapedia  sequences  and  evaluate  in  assays 
described  above  for  the  alteration  of  SNAG  mediated  biological  responses  in  vivo. 
(Months  12-30) 

Accomplishments: 

This  did  not  work  at  all.  We  spent  an  extreme  amount  of  effort  to  produce  soluble  TAT 
SNAG  fusion  peptides  including  nuclear  localization  signals  and  also  including 
determinants  to  increase  solubility  and  uptake  by  cells.  The  alternative  strategy  of 
chemically  attaching  the  SNAG  peptide  to  the  penetraitin  molecule  also  met  with 
complete  failure.  We  included  proper  controls  for  proteins  that  have  been  shown  to  be 
translocated  to  the  nucleus  and  affect  transcription  processes  using  both  these  systems. 
We  conclude  that  the  physical  characteristics  of  the  SNAG  peptide  is  are  not  conducive 
to  this  approach. 

Statement  of  Work: 

Task  3:  Define  the  set  of  genes  that  are  under  control  of  the  SNAIL-SNAP 

Repression,  system  in  breast  cancer  cells  using  cDNA  microarray  analysis 
to  define  the  complete  transcriptome  regulated  by  the  SNAIL  protein. 
(Months  18-36) 

a.  Using  the  inducible  cell  lines  or  the  peptide-based  inhibitor  approach  (above), 
isolate  RNA  populations  from  cells  +/-  induction,  or  +/-  inhibitor  treatment 
(Months  18-24). 

b.  Perform  cDNA  microarray  analysis  of  genes  that  are  differentially  regulated  by 
The  Wistar  Institutes  Microarray  facility  to  define  the  set  of  target  genes  that  are 
repressed  in  SNAIL  expressing  cells,  and  identify  those  that  are  de-repressed  as  a 
result  of  blocking  SNAIL  function.  .  (Months  24-30) 

c.  Examine  target  gene  (from  3b.)  expression  patterns  in  the  inducible  cell  lines  by 
Northern  analysis  and  validate  gene  function  by  antisense  mediated  inhibition 
(Months  30-36 

Accomplishments: 

This  work  is  ongoing  albeit  with  a  completely  different  experimental  strategy.  As 
described  above  in  Task  2a  we  were  unable  to  generate  non-leaky  clones  of  inducible 
SNAIL  or  AJUBA  expressing  cells.  These  clones  were  found  unsuitable  for  Microarray 
analysis.  Task  3  was  completely  dependent  on  this  condition.  However,  we  are  now 
in  the  process  of  asking  this  important  question  in  a  different  way  by  using  our  stable 
siRNA  knockdown  of  AJUBA  and  SNAIL.  We  are  using  individual  clonal  knockdown 
cell  lines  to  profile  genes  that  are  derepressed  using  microarray  technology. 

Additional  discoveries,  partially  supported  by  this  Award  via  experiments  that  were 
not  contained  in  the  original  SOW: 

1.  Discovery  that  SNAG  mediated  e-cadherin  repression  is  not  mediated  by  post- 
translational  modification  by  SUMO,  unlike  many  other. 
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2.  Discovery  that  the  AJUBA  protein  binds  directly  to  the  PRMT5  arginine 
methyltranferase  and  places  repressive  marks  on  nucleosomal  histones  in  SNAIL 
target  genes. 
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Heterochromatin  protein  1  (HP1)  is  a  key  component  of  constitutive  heterochromatin  in  Drosophila  and  is 
required  for  stable  epigenetic  gene  silencing  classically  observed  as  position  effect  variegation.  Less  is  known 
of  the  family  of  mammalian  HP1  proteins,  which  may  be  euchromatic,  targeted  to  expressed  loci  by 
repressor-corepressor  complexes,  and  retained  there  by  Lys  9-methylated  histone  H3  (H3-MeK9).  To 
characterize  the  physical  properties  of  euchromatic  loci  bound  by  HP1,  we  developed  a  strategy  for  regulated 
recruitment  of  HP1  to  an  expressed  transgene  in  mammalian  cells  by  using  a  synthetic,  hormone-regulated 
KRAB  repression  domain.  We  show  that  its  obligate  corepressor,  KAP1,  can  coordinate  all  the  machinery 
required  for  stable  gene  silencing.  In  the  presence  of  hormone,  the  transgene  is  rapidly  silenced,  spatially 
recruited  to  HPl-rich  nuclear  regions,  assumes  a  compact  chromatin  structure,  and  is  physically  associated 
with  KAP1,  HP1,  and  the  H3  Lys  9-specific  methyltransferase,  SETDB1,  over  a  highly  localized  region 
centered  around  the  promoter.  Remarkably,  silencing  established  by  a  short  pulse  of  hormone  is  stably 
maintained  for  >50  population  doublings  in  the  absence  of  hormone  in  clonal-cell  populations,  and  the  silent 
transgenes  in  these  clones  show  promoter  hypermethylation.  Thus,  like  variegation  in  Drosophila, 
recruitment  of  mammalian  HP1  to  a  euchromatic  promoter  can  establish  a  silenced  state  that  is  epigenetically 
heritable. 

[Keywords:  KRAB  domain;  KAP1;  HP1;  heterochromatin;  gene  silencing;  position  effect  variegation] 
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A  recently  emerging  paradigm  for  the  epigenetic  control 
and  propagation  of  gene  expression  states  involves  the 
role  of  chromatin  structure.  Though  historically  viewed 
as  a  passive  packaging  structure  primarily  used  to  as¬ 
semble  the  enormous  amount  of  DNA  into  a  eukaryotic 
nucleus,  the  nucleosome  with  its  complement  of  core 
histones  has  emerged  as  a  key  target  for  regulating  gene 
expression  (Wolffe  and  Hansen  2001).  The  dynamic  regu¬ 
lation  of  chromatin  organization  appears  to  be  accom¬ 
plished  by  macromolecular  protein  complexes  that  con¬ 
tain  enzymatic  activities  that  modify  the  tails  of  the  core 
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histones.  The  constellation  of  these  histone  modifica¬ 
tions,  including  acetylation,  phosphorylation,  ubiquiti- 
nation,  and  methylation,  create  both  synergistic  and  an¬ 
tagonistic  signals  that  correlate  with  the  transcriptional 
activity  of  a  gene  (Wu  and  Grunstein  2000).  This  emerg¬ 
ing  "histone  code"  is  hypothesized  to  create  functionally 
distinct  subdomains  in  chromatin  that  define  active  ver¬ 
sus  transcriptionally  silent  genes  (Jenuwein  and  Allis 
2001).  Histone  modifications  and  the  chromatin-associ¬ 
ated  proteins  that  interpret  these  signals  may  represent 
an  epigenetic  marking  system  responsible  for  setting  and 
maintaining  heritable  programs  of  gene  expression  dur¬ 
ing  development. 

The  role  of  histone  acetylation/deacetylation  [medi¬ 
ated  by  histone  acetyltransferases  (HATs)  and  histone 
deacetylases  (HDACs),  respectively]  in  modulating  gene 
activity  is  now  well  established  (Kuo  and  Allis  1998). 
The  role  of  histone  methylation  in  the  regulation  of 
chromatin  structure  and  gene  transcription  has  been 
greatly  facilitated  by  the  recent  identification  of  histone 
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methyltransferase  (HMTase)  enzymes  (Zhang  and  Rein- 
berg  2001).  The  discovery  that  the  mammalian  homologs 
of  the  Drosophila  melanogaster  heterochromatin  pro¬ 
tein  Su(var)3-9  are  H3-specific  methyltransferases  sig¬ 
nificantly  supported  the  involvement  of  histone  methyl- 
ation  in  gene  regulation  (Rea  et  al.  2000).  Further,  the 
methylation  was  highly  selective  for  Lys  9  with  the 
methyltransferase  function  mapping  to  the  evolution- 
arily  conserved  SET  (SuVar3-9,  Enhancer  of  Zeste,  Tri¬ 
thorax)  domain  (Rea  et  al.  2000).  Because  histone  H3,  Lys 
9  methylation  (MeK9)  is  highly  enriched  in  heterochro¬ 
matin  and  other  transcriptionally  silent  regions  of  the 
nucleus,  it  is  postulated  that  this  modification  might  be 
a  vital  component  of  the  histone  code  that  controls  gene 
silencing. 

The  recent  discovery  that  the  Lys  9-methylated  his¬ 
tone  H3  (H3-MeK9)  mark  establishes  a  high  affinity 
binding  site  for  the  heterochromatin  protein  1  (HP1) 
family  of  heterochromatin  proteins  has  provided  one  of 
the  first  links  between  a  histone  mark  and  establishment 
of  a  repressive  chromatin  environment  for  gene  expres¬ 
sion  (Bannister  et  al.  2001;  Lachner  et  al.  2001;  Na- 
kayama  et  al.  2001).  The  HP1  proteins  are  small  nonhis¬ 
tone  chromosomal  proteins  that  are  composed  of  an 
NH2-terminal  chromodomain  (CD),  a  COOH-terminal 
chromoshadow  domain  (CSD),  and  a  variable  hinge  re¬ 
gion  that  separates  these  two  domains  (Wallrath  1998). 
The  CD  binds  directly,  with  high  affinity  to  the  MeK9 
residue  in  the  histone  H3  tail  (Bannister  et  al.  2001; 
Lachner  et  al.  2001).  The  CSD  is  a  homodimerization 
domain  that  directly  recognizes  a  consensus  pentapep- 
tide  sequence,  PxVxL  with  high  affinity,  which  is  pre¬ 
sent  in  a  growing  number  of  nuclear  proteins  that  may 
target  the  HP1  protein  to  specific  genes  or  subnuclear 
compartments  (Lechner  et  al.  2000;  Smothers  and  Heni- 
koff  2001). 

Clues  to  the  biological  consequences  of  HP  1-chroma¬ 
tin  interaction  have  come  from  the  study  of  D.  melano- 
gaster  HP1  (Wallrath  1998).  HP1  is  intimately  involved 
in  the  phenomenon  of  position  effect  variegation  (PEV) 
in  D.  melanogaster.  Classic  PEV  is  observed  when  a  tran¬ 
scribed  euchromatic  gene  becomes  integrated  adjacent  to 
a  block  of  silent  heterochromatin  (Baker  1968;  Waki- 
moto  1998).  Transcription  is  silenced  in  only  a  subset  of 
cells,  and  this  state  is  stably  inherited  by  their  progeny 
thus  leading  to  variegated  or  mosaic  patterns  of  expres¬ 
sion  in  the  adult  organism.  The  stochastic  nature  of  PEV 
is  hypothesized  to  be  due  to  the  variable  spreading  of 
heterochromatin  (and  thus  transcriptional  silencing) 
into  adjacent  regions  of  the  genome  (Locke  et  al.  1988). 
Genetic  screens  for  modifiers  of  PEV  led  to  the  discovery 
of  the  suppressor  allele  Su(var)2-5  that  encoded  HP1,  a 
discovery  consistent  with  its  earlier  identification  as  a 
heterochromatin-associated  protein  in  polytene  nuclei 
(James  and  Elgin  1986).  Genetic  experiments  revealed 
that  HP1  is  a  strong,  dose-dependent  modifier  of  PEV 
(Locke  et  al.  1988;  Eissenberg  et  al.  1992).  The  HP1  pro¬ 
tein  is  physically  associated  with  both  constitutive  het¬ 
erochromatin  and  adjacent  variegating  transgene,  and 
likely  contributes  directly  to  the  formation  or  stabiliza¬ 


tion  of  silent  heterochromatin  (Eissenberg  et  al.  1992). 
As  would  be  expected,  these  large  blocks  of  HP1 -contain¬ 
ing  heterochromatin  are  highly  enriched  in  H3-MeK9 
(Nakayama  et  al.  2001).  Thus,  the  discovery  of  the  H3- 
MeK9-HPl  connection  establishes  a  direct  link  between 
histone  methylation  and  the  stable  epigenetic  gene  si¬ 
lencing,  which  occurs  in  or  adjacent  to  large  blocks  of 
constitutive  heterochromatin. 

The  above  observations  prompt  two  important  ques¬ 
tions:  (1)  Can  HP1  also  participate  in  the  silencing  of 
constitutively  expressed  euchromatic  genes,  that  is, 
genes  that  are  not  physically  adjacent  to  a  large  block  of 
silent  heterochromatin,  and  if  so,  are  those  silenced 
states  stably  heritable?  (2)  How  are  the  required  pro¬ 
cesses  of  histone  deacetylation,  H3-MeK9  methylation, 
HP1  deposition,  and  chromatin  compaction  coordinated 
in  the  nucleus?  In  our  previous  work,  we  have  ap¬ 
proached  the  latter  question  by  characterizing  the  KAP1 
corepressor.  KAP1  displays  the  hallmarks  of  a  scaffold 
protein  that  can  recruit  and  coordinate  many  of  the  com¬ 
ponents  required  for  HP  1  -mediated  gene  silencing  to  spe¬ 
cific  loci.  KAP1  coordinates  histone  deacetylation  via 
the  recruitment  of  the  NuRD  complex  (Schultz  et  al. 
2001);  histone  H3  Lys  9  methylation  via  the  action  of  a 
novel  KAP1 -associated  histone  methyltransferase 
named  SETDB1  (Schultz  et  al.  2002);  and  direct  binding 
and  deposition  of  HP1  mediated  by  a  highly  conserved 
PxVxL  motif  present  in  KAP1  (Lechner  et  al.  2000)  that 
interacts  with  the  chromoshadow  domain  of  HP1.  More¬ 
over,  KAP1  functions  as  a  corepressor  by  binding  directly 
to  the  highly  conserved  KRAB  repression  domain  that  is 
present  in  more  than  220  human  zinc  finger  proteins, 
suggesting  that  this  mechanism  is  likely  targeted  to  a 
large  number  of  specific  loci  in  vivo  (Peng  et  al.  2000; 
Abrink  et  al.  2001). 

In  the  work  presented  here,  we  have  utilized  the 
KRAB-KAP1  system  to  answer  the  first  question  posed 
above,  that  is,  can  HP1  participate  in  the  silencing  of 
euchromatic  genes?  Toward  this  objective,  we  have  cre¬ 
ated  a  hormone  inducible  system  in  a  mammalian  cell 
line  that  allows  transient  and  reversible  targeting  of  en¬ 
dogenous  KAP1  and  its  associated  activities  to  a  highly 
transcribed  euchromatic  reporter  transgene.  We  con¬ 
clude  that  KAP1  coordinates  the  establishment  of  highly 
localized  heterochromatin-like  silenced  states  at  euchro¬ 
matic  genes  and  that  these  states  are  epigenetically  heri¬ 
table. 


Results 

Hormone  regulatable  chimeric  repressor  proteins 

We  used  a  two-plasmid  system  composed  of  a  regulat¬ 
able  chimeric  repressor  (Fig.  1A)  and  a  synthetic  reporter 
gene  (Fig.  2A).  The  90  amino-acid  KRAB  domain  of  Koxl 
is  sufficient  to  bind  KAP1  and  is  a  very  strong,  DNA- 
binding- dependent  repressor  in  vivo  (Margolin  et  al. 
1994;  Friedman  et  al.  1996).  This  domain  [and  a  mutant, 
KRAB  (DV18  19AA),  which  lacks  repression  activity  and 
fails  to  bind  KAP1]  was  fused  to  the  NH2  terminus  of  the 
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Figure  1.  Characterization  of  chimeric  transcriptional 
repressors.  [A]  Schematic  illustration  of  repressor  pro¬ 
teins.  The  KRAB,  KRAB(DV),  SNAG,  Engrailed,  BTB / 
POZ,  and  WT1  repression  domains  (RDs)  were  fused  in 
frame  with  PAX3-HBD  to  generate  the  RD-PAX3-HBD 
fusion  proteins.  (B)  Immunoprecipitation  of  transfected 
cell  extracts  with  a-PAX3  IgG.  Asterisks  indicate  the 
expressed  proteins.  MW,  molecular  weight  markers.  (C) 
4-OHT-dependent  repression  of  PAX3-luciferase  re¬ 
porter  gene  by  the  chimeric  repressors.  NIH3T3  cells 
were  transfected  with  the  indicated  expression  plas¬ 
mids  and  the  CD19-TK-LUC  reporter  plasmid.  Post¬ 
transfection  and  4-OHT  treatment,  cell  lysates  were  as¬ 
sayed  for  luciferase  and  fi-galactosidase  activities.  Fold 
repression  represents  the  ratio  of  normalized  luciferase 
activity  of  -/+OHT  treated  cells. 


PAX3  DNA  binding  domain  (DBD).  To  make  this  chi¬ 
meric  repressor  hormone  regulatable,  we  fused  the  estro¬ 
gen  receptor  hormone -binding  domain  to  the  COOH  ter¬ 
minus  of  the  PAX3  DBD.  The  ERHBD  contains  G525R 
substitution,  which  renders  it  unresponsive  to  serum  es¬ 
trogens  (Littlewood  et  al.  1995).  These  chimeras  are  in¬ 
active  for  DNA  binding  in  the  absence  of  hormone.  For 
comparison  purposes,  we  also  created  artificial  repres¬ 
sors  using  the  repression  domains  (RD)  from  WT1  (Mad¬ 
den  et  al.  1991),  PLZF-BTB/POZ  (Li  et  al.  1997),  En¬ 
grailed  (Jaynes  and  O'Farrell  1991),  and  GFI-SNAG 
(Zweidler-Mckay  et  al.  1996;  Ayyanathan  et  al.  2000). 
Each  RD-PAX3-HBD  fusion  protein  was  stably  ex¬ 
pressed  in  cells  (Fig.  IB)  and  bound  the  PAX3  recognition 
sequence  (data  not  shown).  However,  only  the  KRAB- 
PAX3-HBD  protein  could  form  a  ternary  complex  with 
the  KAP1  corepressor  and  the  HP1  protein  (data  not 
shown).  Thus,  this  set  of  chimeric  repressors  allows  a 
comparison  of  HPl-mediated  and  HP  1 -independent 
mechanisms  of  gene  silencing. 

4-OHT-dependent  repression  of  a  transient  luciferase 
reporter  template 

The  CD19-TK-LUC-ZeoR  reporter  plasmid  contains  six 
high-affinity  PAX3  recognition  motifs  upstream  of  the 
herpes  simplex  virus  (HSV)  thymidine  kinase  (TK)  pro¬ 
moter.  A  zeocinR  gene  expressed  from  an  SV40  promoter 
was  incorporated  to  provide  both  a  selection  marker  al¬ 
lowing  generation  of  stable  cell  clones  and  a  second  tran¬ 
scription  unit  linked  to  the  luciferase  gene  (Ayyanathan 
et  al.  2000).  The  CD19-TK-LUC-ZeoR  plasmid  showed 
a  high  basal  level  of  luciferase  activity.  We  observed 


4-OHT  dependent  repression  of  the  luciferase  reporter 
(Fig.  1C)  by  each  RD-PAX3-HBD  plasmids.  The  KPHBD 
protein  was  the  most  potent  repressor  (>  10-fold),  while 
the  SPHBD,  EPHBD,  PPHBD,  and  WPHBD  expression 
constructs  elicited  moderate  levels  of  repression  (-3-  to 
6-fold).  The  K(DV)PHBD  protein  was  inactive  for  repres¬ 
sion  activity,  and  none  of  the  RD-PAX3-HBD  proteins 
repressed  a  TK-LUC-ZeoR  reporter,  which  lacked  PAX3 
binding  sites  (data  not  shown).  We  conclude  that  this 
two-plasmid  system  comprises  a  valid,  hormone-  and 
DNA-binding-dependent  repression  model. 

4-OHT-dependent  repression  of  chromatinized 
reporter  transgenes 

To  create  mammalian  cell  lines  with  integrated  lucifer¬ 
ase  reporter  transgenes,  the  DNAs  for  the  CD19-TK- 
LUC-ZeoR  luciferase  reporter  and  each  of  the  RD- 
PAX3-HBD  fusions  were  cotransfected  into  murine 
NIH3T3  fibroblasts  (Fig.  2A).  Following  selection,  clones 
were  expanded,  tested  for  basal  luciferase  activity,  and 
then  tested  for  4-OHT-dependent  repression  activity.  As 
controls,  we  also  generated  stable  cell  lines  that  contain 
only  the  chromatin-integrated  luciferase  reporter  (desig¬ 
nated  as  "CL"  clones).  At  least  48  independent  clonal 
cell  lines  both  for  CL  and  for  each  RD-PAX3-HBD  fu¬ 
sion  were  tested.  A  representative  selection  of  five 
clones  for  each  transfection  is  shown  in  Figure  2B  and  C. 
The  basal  normalized  luciferase  activities,  among 
clones,  varied  from  -103  to  106  light  units/1  O.D.  at  A595 
protein  (Fig.  2B). 

The  CL  clones,  which  do  not  contain  a  stable  RD- 
PAX3-HBD  gene,  showed  no  response  to  4-OHT.  How- 
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Figure  2.  Characterization  of  stable  cell  lines.  (A)  Strategy  to  generate  cell  lines.  (B,C)  4-OHT-dependent  repression  of  chromatinized 
PAX3-luciferase  reporter  gene  in  stable  cell  lines.  First  (top)  graphs  represent  the  normalized  luciferase  activity  (B)  and  fold  repression 
(C)  of  five  independent  "CL"  (luciferase  reporter)  clones.  Second,  third,  and  fourth  graphs  illustrate  the  normalized  luciferase  activity 
( B )  and  fold  repression  (C)  in  clones  stably  expressing  the  respective  RD-PAX3-HBD  proteins.  (D)  Chimeric  repressor  protein  expres¬ 
sion  in  stable  cell  lines.  Gels  illustrate  the  protein  expression  in  KPHBD  (top),  K(DV)PHBD  (middle),  and  SPHBD  ( bottom )  stable  cell 
lines.  Expression  of  full-length  protein  is  indicated  by  in  vitro-translated  protein  (IVT).  (E,F)  Characterization  of  the  KPHBD21  cell  line. 
(E)  Time-  and  4-OHT-concentration-dependent  repression.  KPHBD21  cells  were  treated  with  vehicle  or  varying  concentrations  of 
4-OHT  for  either  0  ,  12  ,  or  24  h.  Fold  repression  of  luciferase  activity  was  determined  as  above.  (F)  4-OHT-dependent  transcriptional 
repression  of  luciferase  gene.  Oligo-dT  primed  first  strand  cDNAs  from  vehicle  or  4-OHT-treated  KPHBD21  cells  were  monitored  for 
luciferase,  neomycinR,  and  zeocinR  transcripts  by  quantitative  PCR.  Arbitrary  Phosphorlmager  units  obtained  from  the  Southern 
analysis  are  plotted.  ■,  -OHT;  •,  +OHT. 


ever,  strong  4-OHT-dependent  repression  was  observed 
in  KPHBD21  cell  line  (~  16-fold)  while  the  KPHBD30, 
KPHBD33,  and  KPHBD36  clones  manifested  moderate 
repression  activity  (~3-  to  5-fold).  In  contrast,  each  of  the 


K(DV)PHBD  transfected  clones  possessed  a  high  basal 
luciferase  activity  that  was  unaffected  by  4-OHT  treat¬ 
ment.  The  SPHBD  transfection  produced  clones  that 
showed  six-  to  sevenfold  repression  in  response  to 
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4-OHT  (Fig.  2C).  Using  metabolic  labeling  and  immuno- 
precipitation  with  a-PAX3  IgG,  each  clone  expressed 
full-length  RD-PAX3-HBD  protein  to  a  level,  which  cor¬ 
related  roughly  with  the  4-OHT-dependent  repression 
potential  of  these  cells  (Fig.  2D).  Stable  cell  lines  con¬ 
taining  an  integrated  luciferase  reporter  and  the 
engrailed-PAX3-HBD  (EPHBD),  PLZF-POZ-PAX3-HBD 
(PPHBD),  or  WT1-PAX3-HBD  (WPHBD)  were  also  gen¬ 
erated,  which  showed  demonstrable  levels  of  4-OHT-de- 
pendent  repression  (data  not  shown).  We  conclude  the 
following:  ( 1 )  4-OHT  treatment  does  not  affect  the  basal 
expression  of  the  reporter  luciferase  transgene  when  sta¬ 
bly  integrated  at  many  different  sites  in  the  genome;  (2) 
the  KPHBD  protein  is  a  powerful,  hormone-dependent 
repressor  of  the  integrated  reporter  transgene;  and  (3) 
NIH3T3  cells  contain  the  machinery  required  to  support 
SNAG  domain  mediated  repression. 

Molecular  characterization  of  a  KRAB-PAX3-HBD 
stable  cell  line 

The  KPHBD21  clone  was  further  characterized.  First,  a 
PCR  using  genomic  DNA  showed  that  the  PAX3  binding 
sites,  the  TK  promoter,  the  luciferase  gene,  and  the  zeo- 
cinR  cassette  were  physically  linked  in  the  integration 
site  (data  not  shown).  Second,  Southern  blotting 
suggested  that  an  estimated  two  to  five  copies  of  the 
CD19-TK-LUC-ZeoR  were  present  (data  not  shown). 
Third,  micrococcal  nuclease  digestion  showed  that  a 
compact  nucleosome  pattern  accompanied  repression 
(data  not  shown).  Fourth,  repression  was  strongly 
time-  and  4-OHT-concentration-dependent  (Fig.  2E). 
Fifth,  4-OHT-dependent  repression  occurred  at  the  level 
of  transcription  as  shown  by  decreases  in  the  abundance 
of  luciferase  mRNA  measured  by  quantitative  reverse 


transcriptase  PCR  (RT-PCR;  Fig.  2F).  However,  tran¬ 
scription  at  both  the  linked  zeocinR  locus  and  the  un¬ 
linked  neomycinR  locus  were  unaffected  by  4-OHT 
treatment  (Fig.  2F).  Thus,  KRAB-mediated  repression  is 
highly  localized,  as  a  linked  transcription  unit  (zeocinR 
cassette)  -2.8  kbp  away  from  the  repressor  binding  site  is 
unaffected. 

KRAB-PAX3-HBD  protein  induces  a  highly  localized 
compact  chromatin  structure 

To  begin  to  dissect  the  changes  in  chromatin  structure 
that  accompany  silencing,  nuclease  accessibility  assays 
were  performed  using  known  restriction  enzyme  sites 
in  the  CD19-TK-LUC-ZeoR  plasmid.  As  shown  in  Fig¬ 
ure  3A,  each  enzyme  showed  dramatic  inhibition  of 
cleavage  in  the  nuclei  derived  from  4-OHT-treated 
nuclei.  A  region  of  257  bp  flanking  the  TK  promoter 
and  transcription  initiation  site  in  the  luciferase  gene 
is  converted  to  a  compact,  nuclease-resistant  struc¬ 
ture  upon  4-OHT  treatment.  Strong  cleavage  was  ob¬ 
served  at  both  the  Ncol  and  Stul  sites  in  the  presence  of 
4-OHT  at  the  unlinked  neomycinR  gene  (Fig.  3B).  Similar 
sites  in  the  linked  SV40  promoter-zeocinR  gene  also 
showed  equal  accessibility  regardless  of  4-OHT  treat¬ 
ment  (Fig.  3C).  Thus,  the  compact  chromatin  structure 
induced  by  KPHBD  binding  to  the  PAX3  sites  near  the 
TK  promoter  is  not  established  at  a  promoter  2.8  kbp 
distant. 

Recruitment  of  KAP1  and  HP1  to  the  reporter 
luciferase  transgene 

We  performed  comprehensive  chromatin  immunopre- 
cipitation  (ChIP)  analyses  of  the  integrated  locus  and  the 
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Figure  3.  Differential  endonuclease  accessibility  ob¬ 
served  only  at  the  luciferase  locus.  [A]  Restriction  en¬ 
donuclease  sensitivity  at  the  TK  promoter-luciferase 
locus.  Nuclei  isolated  from  vehicle-  (-OHT)  or  4-OHT 
(+OHT)-treated  KPHBD21  cells  were  treated  with  the 
indicated  restriction  endonucleases.  Purified  DNA  was 
used  in  reiterative  primer-extension  PCR  with  LUC1 
primer.  The  denatured  products  were  resolved  on  Urea- 
PAGE  and  autoradiographed.  (B,C)  Restriction  endo¬ 
nuclease  sensitivity  at  the  neomycinR  [B]  and  zeocinR 
(C)  loci  monitored  by  using  NE02  and  ZE02  primers. 
MW,  radiolabeled  <|>X174  Hae III  digest;  nt,  nucleotide 
length;  PAX3BS,  PAX3-binding  site. 
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results  are  presented  in  Figure  4.  Using  antibodies  spe¬ 
cific  to  the  PAX3,  KAP1,  HPla  and  HPly  proteins,  the 
DNA  recovered  was  analyzed  by  quantitative  PCR  using 
primer  pairs  for  regions  of  the  CD19-TK-LUC,  zeocin 
loci  (Fig.  2A).  Fragments,  which  bracket  the  PAX3  bind¬ 
ing  sites,  were  strongly  enriched  (5-10-fold)  in  the  PAX3 
immunoprecipitates  (IPs)  after  4-OHT-treatment.  Other 
components  of  the  KRAB  repression  complex  (i.e.,  KAP1, 
HPla,  HPl-y)  were  inducibly  recruited  to  the  target  gene. 
This  enrichment  was  most  evident  for  the  primer  pair 
that  directly  flanks  the  PAX3  binding  sites  (PBS1  and 
PBS2).  This  fragment  is  enriched  10-,  1 1  -,  and  -5-fold  in 
PAX3,  KAP1,  and  HPla  IPs,  respectively.  No  signal  (NS) 
was  detected  from  the  CL2  reporter  cell  line,  which  lacks 
a  KPHBD  repressor  expression.  The  257  bp  fragment 
spanning  the  TK  promoter  and  transcription  initiation 
site  is  most  highly  enriched  in  KAP1  and  HPla  IPs,  10- 
and  27-fold,  respectively.  This  is  different  from  HPly, 
which  is  constitutively  bound  to  that  region  and  not  en¬ 
riched  upon  4-OHT-treatment.  No  signal  was  obtained 
for  the  3'  end  of  the  luciferase-coding  region  (located  1.2 
kbp  downstream  from  the  TK  promoter;  primers  LUC2 
and  LUC3),  the  SV40-zeocinR  cassette,  or  the  unlinked 
SV40-neomycinR  cassette  (data  not  shown).  Thus,  re¬ 
cruitment  of  KAP1  and  HPla  is  highly  specific  for  a  tar¬ 
get  gene  regulated  by  a  DNA-bound  KRAB  repression 
domain.  Moreover,  the  association  of  KAP1  and  HPla  to 
the  chromatin  occurs  in  a  highly  localized  region. 


Recruitment  of  KAP1  and  HP1  to  an  endogenous 
target  gene 

To  determine  whether  similar  physical  characteristics 
occur  at  an  endogenous  target  gene  that  is  silenced  by  a 


naturally  occurring  KRAB  zinc  finger  protein,  we  studied 
the  NT2  KRAB-ZFP  (Fig.  5A).  NT2  protein  is  abundant 
in  NIH3T3  cells  (Fig.  5B),  binds  to  a  24-bp  consensus 
sequence  in  the  Collla2  promoter,  and  silences  it 
(Tanaka  et  al.  2002;  Fig.  5C).  NT2  is  highly  transcribed 
while  the  Collla2  transcript  is  undetectable  suggesting 
that  NT2  is  contributing  to  Collla2  silencing  in 
NIH3T3  cells.  However,  the  tightly  linked  RXRfi  gene, 
which  is  present  5'  of  Collla2  is  highly  expressed  sug¬ 
gesting  that  NT2  mediated  silencing  is  not  long  range 
(Fig.  5C,D).  Moreover,  ChIP  experiments  demonstrated 
NT2  at  the  predicted  binding  site  but  not  enriched  in  the 
proximal  promoter  region.  However  KAP1,  HPla,  HPly, 
and  SETDB1  proteins  were  abundantly  present  at  both 
regions.  The  promoter  region  was  also  highly  enriched  in 
H3-MeK9  (Fig.  5E).  These  results  were  strikingly  similar 
to  those  observed  with  the  integrated  transgene  and 
strongly  suggest  that  a  localized  heterochromatin  struc¬ 
ture  is  generated  at  an  endogenous  locus  silenced  by  a 
KRAB-ZFP. 


Spatial  relocalization  of  the  luciferase  transgene 
to  condensed  chromatin 

We  performed  fluorescence  in  situ  hybridization  (FISH) 
analysis  on  the  KPHBD21  cell  line  using  a  probe  for  the 
CD19-TK-LUC  luciferase  reporter  plasmid,  and  the 
cells  were  counter-stained  with  either  DAPI  or  Hoechst 
dyes  to  identify  condensed  heterochromatin  structures 
(Fig.  6A,B).  A  single  locus  was  observed  in  the  KPHBD21 
line  in  every  cell  examined,  strongly  suggesting  a  single 
integration  site  for  the  reporter  plasmid  (data  not 
shown).  In  the  absence  of  4-OHT,  the  FISH  signals  were 
most  frequently  found  to  be  spatially  distinct  from  the 


Figure  4.  KRAB-box-dependent  recruitment  of  KAP1 
and  HP1  proteins  to  the  integrated  luciferase  transgene. 
CL2  and  KPHBD21  cells  were  treated  with  either  ve¬ 
hicle  (-OHT)  or  4-OHT  (+OHT),  and  the  DNA-protein 
complexes  were  chemically  cross-linked  in  vivo  with 
formaldehyde.  Soluble,  sonicated  chromatin  was  im- 
munoprecipitated  with  a-PAX3,  ct-KAPl,  a-HPla,  and 
a-HPly  antibodies.  Quantitative  PCR  was  carried  out 
with  input  and  immunoprecipitated  DNAs  using  spe¬ 
cific  primer  pairs  indicated  in  Figure  2A.  PCR-amplified 
DNA  fragments  are  depicted  from  ChIPs  with  a-PAX3, 
a-KAPl,  a-HPla,  and  a-HPly  IgG.  Bold  vertical  lines 
on  the  right  indicate  the  relative  positions  of  the  trans¬ 
gene  fragments  amplified.  The  numbers  in  parentheses 
indicate  fold  enrichment,  and  NS  denotes  no  signal. 
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Figure  5.  NT2-KRAB  zinc  finger  protein  stably  re¬ 
presses  the  expression  of  endogenous  Collla2  gene 
in  NIH3T3  cells.  [A]  Diagrammatic  representation 
of  the  mouse  NT2-KRAB  zinc  finger  protein.  The 
positions  of  SCAN,  KRAB  domains,  ZF  motifs,  and 
the  antigen  used  for  a-NT2  antibody  production  are 
indicated.  [B]  Abundant  expression  of  NT2  protein 
in  NIH3T3  cells.  Nuclear  proteins  (100  pg)  were 
electrophoresed  on  a  10%  SDS-PAGE  and  immuno- 
blotted  with  affinity-purified  a-NT2  antibody.  The 
-75  kD  NT2  protein  is  indicated  by  an  arrow  and  a 
nonspecific  band  is  marked  by  an  asterisk.  (C)  Ge¬ 
nomic  structure  of  RXRfi  and  Collla2  genes.  The 
Collla2  gene  is  organized  in  a  head-to-tail  orienta¬ 
tion  with  RXRfi.  The  relative  locations  of  oligo¬ 
nucleotides  used  in  ChIP-PCR  experiments  and  the 
fragment  sizes  are  indicated.  El  to  E37  represent  the 
exons  of  Collla2  gene.  (D)  Expression  patterns  of 
NT2,  Collla2,  and  RXR |3  transcripts  in  NIH3T3 
cells.  Oligo-dT-primed  first-strand  cDNAs  were  am¬ 
plified  by  quantitative  PCR  using  primer  pairs  spe¬ 
cific  for  the  NT2,  Collla2,  and  RXR(3  transcripts 
and  electrophoresed  on  1.5%  agarose  gel.  (£)  Com¬ 
ponents  of  KRAB  repression  machinery  are  enriched 
at  the  silenced  Collla2  locus.  Cross-linked  chroma¬ 
tin  was  immunoprecipitated  with  the  indicated  an¬ 
tibodies  and  the  bound  DNA  was  analyzed  by  PCR 
using  primers  indicated  in  C. 


condensed  chromatin  islands  as  single  green  dots.  How¬ 
ever,  following  addition  of  4-OHT,  there  was  a  clear  as¬ 
sociation  of  the  majority  of  luciferase  FISH  signals  with 
the  condensed  chromatin  territories.  We  quantitated  this 
pattern  by  analyzing  many  cells  (Fig.  6C).  Using  HPla 
antibodies,  which  stained  the  heterochromatic  islands 
detected  by  the  DNA  dyes,  4-OHT-dependent  spatial  re¬ 
cruitment  of  the  luciferase  FISH  signals  to  the  HP  1 -rich 
regions  was  also  observed  (data  not  shown).  Thus,  spatial 
relocalization  of  the  reporter  gene  appears  to  accompany 
KRAB-KAPl-HPl-dependent  silencing  in  this  system. 

Transient  exposure  of  the  luciferase  reporter  transgene 
to  KRAB-PAX3-HBD  induces  stable  silencing 

A  hallmark  property  of  HP  1 -dependent  gene  silencing  is 
that  the  silenced  state  is  stably  maintained  through 
many  cell  divisions  (Lu  et  al.  1998).  To  determine  if  there 
is  a  stable  component  to  KRAB-KAPl-HPl-mediated 
gene  silencing  in  the  KPHBD21  cell  line,  we  performed 
4-OHT  washout  experiments.  We  took  advantage  of  the 
fact  that  4-OHT  is  readily  removed  from  cells  and  that 
its  effects  on  HBD  fusion  proteins  are  readily  reversible 
(Littlewood  et  al.  1995;  Pelengaris  et  al.  1999).  A  24-h, 
4-OHT  treatment  produced  ~24-fold  reduction  in  lucif¬ 


erase  activity  (Fig.  7A;  filled  bars).  Twenty-four  hours 
after  hormone  removal,  the  luciferase  activity  was  only 
two-  to  threefold  lower  than  a  duplicate  dish,  which  had 
not  been  treated  with  4-OHT,  thus  showing  substantial 
reversal  of  the  repressed  state  and  verifying  that  the  hor¬ 
mone  was  removed.  However,  dishes  harvested  at  2,  3, 
and  4  d  after  4-OHT  removal  still  maintained  a  substan¬ 
tially  lower  basal  luciferase  level  than  an  untreated  du¬ 
plicate  dish.  The  hormone  had  no  effect  on  cell  growth 
rate  and  the  cells  went  through  about  four  rounds  of  cell 
division  during  the  4-d  time  period.  This  result  was  even 
more  pronounced  when  the  initial  4-OHT  treatment  was 
for  48  h  (Fig.  7 A,  patterned  bars):  Dishes  maintained  in 
the  absence  of  hormone  for  4  d  showed  approximately 
sixfold  lower  basal  luciferase  activity  compared  to  un¬ 
treated  dishes.  This  result  suggested  that  gene  silencing 
might  be  stably  maintained  in  a  subpopulation  of  cells. 

To  further  explore  this  phenomenon,  we  performed 
longer-term,  4-OHT  treatment  and  washout  studies  (Fig. 
7B).  A  2-,  4-,  or  6-d  4-OHT  treatment  of  KPHBD21  cells 
strongly  repressed  the  reporter  luciferase  activity  (>50- 
fold  reduction  in  luciferase  activity  at  6  d).  However, 
following  4-OHT  removal,  activity  was  substantially, 
but  never  completely,  recovered.  This  effect  was  still 
evident  at  8  d  post-4-OHT  removal,  a  time  course  that 
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Figure  6.  4-OHT-dependent  association  of  the  lucifer- 
ase  transgene  with  condensed  chromatin  regions.  The 
KPHBD21  cells  were  seeded  onto  glass  coverslips  and 
treated  with  either  vehicle  (-OHT)  or  4-OHT  (+OHT). 
FISH  was  carried  out  using  luciferase  probe  and  the 
nuclear  DNA  was  visualized  by  staining  with  either 
DAPI  (A)  or  Hoechst  (B).  The  number  of  cells  analyzed 
for  each  condition  is  indicated  (C).  The  results  are  pre¬ 
sented  as  the  percent  of  FISH  signals  that  were  either 
associated  (whitish  green  dots)  or  not  associated  (green 
dots)  with  the  DAPI-  or  Hoechst-stained  spots. 
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included  extensive  washing  of  each  dish  every  day,  and 
included  a  trypsinization  and  replating  at  semiconfluent 
cell  densities.  No  repression  was  observed  in  two  cell 
clones  containing  KRAB(DV)-PAX3-HBD  fusion  protein 
(Fig.  7C).  Moreover,  though  substantial  repression  was 
observed  in  each  of  two  SNAG-PAX3-HBD  cell  clones 
after  a  4-d  hormone  treatment,  the  luciferase  activity 
was  fully  recovered  following  4-OHT  removal  (Fig.  7D). 

KRAB-KAP1-HP1  mediated  gene  silencing  is 
mitotically  heritable  over  many  cell  generations 

HP1 -dependent  gene  variegation  reflects  the  ability  to 
maintain  gene  silencing  over  many  cell  generations  in  a 
clone  of  cells.  To  determine  if  a  similar  effect  occurred  at 
a  euchromatic  gene  bound  by  HP1,  we  performed  lucif¬ 
erase  activities  in  cell  populations  that  were  transiently 
pulsed  with  4-OHT.  Three  subclones  of  the  KPHBD21 
cell  line,  KPHBD21-08,  KPHBD21-39,  and  KPHBD21-49, 
were  further  studied.  Duplicate  dishes  of  cells  were 
treated  with  4-OHT  or  vehicle  for  4  d,  then  washed,  and 
subjected  to  a  limiting  dilution  cloning.  Clones  were  ex¬ 
panded  in  the  absence  of  4-OHT  for  -40  population  dou¬ 
blings  and  tested  for  basal  luciferase  activity  (Fig.  8A). 
For  KPHBD21-08,  69  untreated  subclones  were  isolated: 
their  normalized  luciferase  activities  showed  a  mean  of 
~105  light  units/O.D.  of  protein  and  varied  <7-fold  from 
lowest  to  highest.  However,  subline  KPHBD21-08  cells 
that  had  received  4-OHT  prior  to  single  cell  cloning 
yielded  a  set  of  subclones  with  a  dramatic  skewing  of 
activities.  More  than  one-third  of  65  clones  yielded  basal 
luciferase  activities  substantially  lower  than  the  lowest 
subclone  derived  from  the  untreated  population  (Fig.  8B). 


Remarkably,  a  portion  of  the  clones  showed  barely  de¬ 
tectable  luciferase  activities. 

A  statistical  analysis  using  Fisher's  test  (F-test),  which 
compares  the  variances  of  two  samples,  yielded  a  highly 
significant  F  value  of  0.00000003.  Identical  results  were 
obtained  with  subclones  KPHBD21-39  and  KPHBD21- 
49,  where  the  F  values  comparing  luciferase  activity  in 
untreated  and  treated  populations  were  F  =  0.00074  and 
F  =  0.00014,  respectively  (Fig.  8B).  The  subclones  main¬ 
tain  resistance  to  zeocin  and  G418  and  thus  have  not 
deleted  the  transgenes.  Moreover,  the  stably  repressed 
clones  have  maintained  the  silent  state  for  >10  mo  in 
culture  (data  not  shown).  These  results  strongly  suggest 
that  transient  recruitment  of  the  KRAB-KAP1-HP1 
complex  to  a  euchromatic  gene  produces  a  silenced  state 
that  is  mitotically  heritable  and  does  not  require  a  per¬ 
sistent  DNA-binding  component. 

Molecular  characteristics  of  the  target  locus  in  the 
silenced  and  expressed  clones 

To  understand  the  molecular  basis  for  this  stable  silenc¬ 
ing,  we  characterized  the  physical  state  of  the  silent  and 
expressed  luciferase  genes.  We  selected  silent  (cl  39-40) 
and  expressed  (cl  39-45)  subclones  whose  normalized  lu¬ 
ciferase  activities  were  three  orders  of  magnitude  differ¬ 
ent  (Fig.  9 A,  top  panel).  Using  ChIP,  we  observed  a  strong 
enrichment  of  KAP1  (11.66-fold),  SETDB1  (10.33-fold), 
HPla  (6.93-fold),  H3-MeK9  (5.3-fold),  and  a  moderate  en¬ 
richment  of  HPly  (2.43-fold)  proteins  in  the  silent  clone 
compared  to  the  expressed  clone  (Fig.  9 A,  bottom  panel). 
However,  the  level  of  the  KRAB-PAX3-HBD  component 
at  the  transgene  was  very  low  and  similar  in  the  silent 
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Figure  7.  The  KPHBD  protein  induces  stable  repression  of  the 
luciferase  transgene.  [A]  Transient  4-OHT  treatment  of 
KPHBD21  cells  induces  a  stable  silencing  component.  Dupli¬ 
cate  dishes  of  KPHBD21  cells  were  treated  with  either  vehicle 
or  4-OHT  (+OHT)  continuously  for  24  or  48  h  and  washings 
performed.  Pairs  of  dishes  (-/+OHT)  were  harvested  at  the  in¬ 
dicated  time  points  post  4-OHT  removal  and  fold  repression 
was  determined.  Filled  bars,  24-h  4-OHT  treatment;  patterned 
bars,  48-h  4-OHT  treatment.  (B)  A  longer  4-OHT  treatment  of 
KPHBD21  cells  can  lead  to  a  larger  stable  silencing  component. 
Duplicate  dishes  of  KPHBD21  cells  were  treated  with  either 
vehicle  or  4-OHT  continuously  for  2,  4,  or  6  d.  Washings  and 
replatings  were  performed  as  described  in  Materials  and  Meth¬ 
ods.  Pairs  of  dishes  (-/+OHT)  were  harvested  every  day  and  fold 
repression  was  determined.  ♦ ,  2  d  of  4-OHT  treatment;  •,  4  d 
of  4-OHT  treatment;  ■,  6  d  of  4-OHT  treatment.  (C)  4-OHT - 
induced  repression  is  not  observed  in  K(DV)PHBD  clones.  The 
K(DV)PHBD3  (O)  and  K(DV)PHBD22  {•)  clonal  cell  lines  were 
treated  with  vehicle  or  4-OHT  continuously  for  4  d,  washed 
extensively,  and  fold  repression  determined.  (D)  4-OHT-in- 
duced  silencing  component  is  not  observed  in  SPHBD  clones. 
The  SPHBD11  (•)  and  SPHBD20  (O)  clonal  cell  lines  were 
treated  with  vehicle  or  4-OHT  continuously  for  4  d,  washed 
extensively,  and  fold  repression  determined. 


and  active  clones.  To  verify  that  the  PAX3  DNA  binding 
sites  are  present  and  accessible  in  the  silent  clone,  we 
transfected  a  VP16-PAX3  plasmid  that  functions  as  a 
powerful  activator.  We  observed  a  dose-dependent  acti¬ 
vation  of  the  silent  luciferase  locus  (Fig.  9B).  These  re¬ 
sults  suggest  that  the  molecular  components  of  the 
KRAB-KAP1-HP1  repression  pathway  are  constitutively 
present  at  the  silenced  locus  and  also  that  the  mainte¬ 
nance  of  the  silent  state  is  not  the  result  of  persistent 
DNA  binding  of  the  KRAB-PAX3-HBD  protein. 

To  further  characterize  the  silent  clone,  we  treated  the 
cells  with  5-azacytidine  (5AC)  and  trichostatin-A  (TSA). 
Treatment  with  either  5AC  or  TSA  alone  did  not  signifi¬ 
cantly  reactivate  the  locus  (Fig.  9C).  However,  sequential 
treatment  with  5AC  followed  by  TSA  was  highly  syner¬ 
gistic  in  reactivating  the  silent  locus  (Fig.  9C)  suggesting 


that  the  silent  state  may  be  maintained  as  a  result  of 
DNA  methylation  (Cameron  et  al.  1999).  To  confirm 
this,  we  determined  the  methylation  status  of  the  TK 
promoter  regions  of  the  silent  and  active  clones  by  meth- 
ylation-specific  PCR  (MSPCR;  Fig.  9D;  Herman  et  al. 
1996).  The  methylation-specific  primer  set  (M)  generated 
a  strong  amplification  product  using  DNA  from  the  si¬ 
lent  clone.  However,  no  product  was  seen  using  DNA 
from  the  active  clone.  Conversely,  the  unmethylated 
specific  primers  (U)  only  produced  a  product  from  the 
active  clone.  These  results  suggest  that  the  silent  and 
active  clones  have  different  CpG  methylation  profiles  at 
the  promoter. 

To  map  regions  of  methylation,  we  sequenced  the 
cloned  PCR  products  derived  from  bisulfite-treated  ge¬ 
nomic  DNAs  (Fig.  9E).  The  active  clone  showed  nonran¬ 
dom  CpG  methylation  that  was  highly  restricted  to  the 
region  immediately  surrounding  the  transcription  start 
site.  However,  the  silent  clone  showed  dramatically  en- 
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Figure  8.  Clonal  analysis  of  luciferase  gene  expression  in 
KPHBD21  cells.  [A]  An  experimental  scheme  designed  to  mea¬ 
sure  variegated  expression  of  the  luciferase  gene  in  KPHBD21 
cells.  (B)  Normalized  luciferase  activities  for  single  cell  progeny 
derived  from  KPHBD21-8,  KPHBD21-39,  or  KPHBD21-49.  Each 
□  and  <>  represents  an  independent  single-cell  subclone  of  the 
parental  line  treated  with  either  vehicle  (-OHT)  or  4-OHT 
(+OHT),  respectively.  The  total  number  of  single-cell  subclones 
analyzed  for  each  parental  line  is  indicated  at  the  bottom.  Ar¬ 
rows  denote  the  silent  (cl  39-40)  and  active  (cl  39-45)  clones 
studied  in  detail. 
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Figure  9.  Molecular  characteristics  of  the  active  and  silent 
clones.  ( A )  Prior  to  ChIP  experiments,  the  silent  (cl  39-40)  and 
active  (cl  39-45)  clones  were  tested  for  their  basal  luciferase 
activities  ( top  panel).  Next,  the  cross-linked  chromatin  frac¬ 
tions  obtained  from  silent  (cl  39-40)  and  active  (cl  39-45) 
clones  were  used  in  ChIPs  with  anti-PAX3,  KAP1,  SETDB1, 
HPla,  HPl-y,  and  MeK9  antibodies.  The  input  and  the  immu- 
noprecipitated  DNAs  were  amplified  using  TKP1  and  LUC1 
primers  (Fig.  2A).  Fold  difference  was  determined  by  Phospho- 
rlmager  analysis  of  the  Southern  blots  ( bottom  panel).  [B]  In¬ 
dicated  concentrations  of  VPDBD  plasmid  was  transiently 
transfected  into  the  silent  clone  (cl  39-40),  the  cell  lysates 
were  assayed  for  luciferase  activities  and  normalized  with 
(3-galactosidase  values.  The  filled  and  patterned  bars  represent 
results  obtained  from  two  independent  experiments.  (C)  The 
silent  (cl  39-40)  and  active  (cl  39-45)  clones  were  plated  in 
multiple  dishes  and  treated  with  the  indicated  chemicals  for 
specified  durations.  The  filled  and  patterned  bars  represent 
the  luciferase  activity  obtained  from  duplicate  dishes.  (D) 
MSPCR.  Genomic  DNAs  extracted  from  the  silent  (cl  39-40) 
and  active  (cl  39 — 45)  clones  were  treated  with  sodium  bisulfite 
and  amplified  using  unmethylated  sense  (US)  or  methylated 
sense  (MS)  primers  and  an  antisense  primer  (UMA1)  belonging 
to  the  TK  promoter  indicated  in  E  and  shown.  ( E )  Sodium 
bisulfite-genomic  sequencing.  The  bisulfite-modified  genomic 
DNAs  of  the  silent  (cl  39-40)  and  active  (cl  39-45)  clones  were 
PCR-amplified  using  UMS1  (sense)  and  UMA1  (antisense) 
primers.  The  PCR  products  were  TA-cloned  into  pCR  II  vector 
(Invitrogen).  A  representative  selection  of  clones  from  both  cl 
39-40  and  cl  39-45  were  sequenced  and  the  results  presented. 
Hanging  0 ,  the  positions  of  CpG  residues;  □,  the  unmethyl¬ 
ated  CpG;  B  methylated  CpG  residues.  Arrows  pointing  up 
mark  the  CpG  residues  that  are  preferentially  methylated  in 
the  genomic  clones  of  cl  39-40. 
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hanced  density  of  CpG  methylation  that  was  spread  both 
5'  and  3'  of  the  transcription  start  site.  A  total  of  12  new 
CpG  sites  were  methylated  in  the  silenced  clone  (Fig. 
9E).  Together,  these  results  suggest  that  DNA  methyl¬ 
ation  may  contribute  to  the  mitotically  heritable  gene 
silencing  we  observe. 

Discussion 

Until  recently,  most  of  our  insights  into  HP1  protein 
function  and  the  role  of  heterochromatin  in  gene  regu¬ 
lation  have  come  from  the  study  of  position  effect  var¬ 
iegation  (PEV)  in  Drosophila.  Some  of  the  key  principles 
established  include  the  following:  pericentromeric  and 
telomeric  heterochromatin  is  enriched  in  DNA  repeats 
that  are  packaged  in  a  compact,  chromatin  structure. 


These  regions  have  a  low  gene  density,  are  late-replicat¬ 
ing,  and  appear  highly  suppressed  for  both  recombina¬ 
tion  and  transcription  (Wallrath  1998).  The  large  blocks 
of  constitutive  heterochromatin  are  highly  enriched  in 
the  HP1  protein,  which  can  function  as  exquisite  dose- 
sensitive  silencers  of  an  adjacent  euchromatic  transgene. 
Once  established,  this  silencing  is  mitotically  heritable 
through  many  cell  divisions  giving  rise  to  the  classic 
variegated  gene  expression  phenotype  in  the  adult  or¬ 
gans.  Whether  these  principles  will  hold  true  in  higher 
metazoans  is  a  key  question. 

Recent  evidence  suggests  a  broader  role  for  HP1  and 
heterochromatin  in  mammalian  gene  regulation.  First, 
the  human  and  mouse  genomes  contain  at  least  three 
HP1  homologs,  some  of  which  display  constitutive  eu¬ 
chromatic  localization  (Ryan  et  al.  1999;  Mine  et  al. 
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2000) .  Second,  heterodimerization  can  occur  among 
these  HP1  isoforms  (Nielsen  et  al.  2001).  Third,  there  are 
a  host  of  regulatory  proteins,  which  target  HP1  to  spe¬ 
cific  loci  (Lechner  et  al.  2000;  Smothers  and  Henikoff 

2001) .  Finally,  there  is  distinct  tissue-,  cell-type  specific¬ 
ity  of  the  HP1  isoforms  including  an  emerging  role  in 
tumor  progression  (Kirschmann  et  al.  2000). 

In  this  article,  we  have  established  some  of  the  first 
principles  for  the  mechanism  of  action  of  mammalian 
HP1  at  a  euchromatic  locus.  The  KRAB-KAP1  repres¬ 
sion  system  we  used  is  a  physiologically  relevant  target¬ 
ing  mechanism  for  HP1  proteins  and,  in  addition,  coor¬ 
dinates  two  other  activities  that  are  likely  to  be  required 
for  silencing:  recruitment  of  the  NuRD-HDAC  complex 
(Schultz  et  al.  2001)  and  recruitment  of  the  methyltrans- 
ferase,  SETDB1  (Schultz  et  al.  2002).  Our  results  support 
the  following  conclusions:  (1)  The  KRAB  domain  is  able 
to  coordinate  machinery  for  strong  transcriptional  re¬ 
pression  of  an  integrated,  chromatinized  RNA  pol  II  tran¬ 
scribed  target  gene,-  (2)  repression  is  relatively  short- 
range,-  (3)  repression  is  accompanied  by  a  highly  localized 
chromatin  compaction;  (4)  the  KAP1  corepressor,  and 
HPla/y  proteins  are  physically  associated  with  the  re¬ 
pressed  gene  in  a  highly  localized  manner;  (5)  the  in¬ 
duced,  silent  state  of  the  transgenic  reporter  is  mitoti- 
cally  heritable  in  the  absence  of  hormone  for  at  least  40 
population  doublings;  and  (6),  stable  silencing  is  appar¬ 
ently  specific  for  the  KRAB-KAP1-HP1  mechanism  as 
other  repression  domain  fusions  (which  do  not  bind  HP1 ) 
do  not  induce  a  heritable,  silenced  state.  Together,  these 
results  strongly  suggest  that  we  have  established  a  model 
for  HP1 -dependent  silencing  and  variegation  of  a  euchro¬ 
matic  gene  in  a  mammalian  cell  line. 

A  critical  issue  for  interpretation  of  these  results  is  the 
chromatin  environment  of  the  integrated  reporter  trans¬ 
gene.  We  strongly  favor  the  designation  of  the  luciferase 
reporter  as  euchromatic  based  upon  the  following  func¬ 
tional  criteria:  (1)  All  clones  show  a  high  basal  level  of 
luciferase  activity,-  (2)  all  clones  both  express,  and  show 
physical  linkage  to  the  zeocin  resistant  cassette;  (3)  the 
reporter  genes  are  packaged  into  a  nuclease-accessible 
chromatin  structure,-  and  (4)  interphase  FISH  shows  the 
expressed  reporter  transgene  present  in  euchromatic 
chromosome  territories.  Thus,  while  designation  of  a 
gene  as  euchromatic  is  historically  based  upon  cytologi- 
cal  analyses,  clearly,  the  reporter  plasmids  have  inte¬ 
grated  into  a  region  permissive  for  a  high  level  of  expres¬ 
sion. 

The  KRAB  box  functions  as  a  short  range  repressor 

A  clear  result  with  this  system  is  that  the  mechanism  of 
KRAB-mediated  repression  appears  to  function  over  rela¬ 
tively  short  distances  in  chromatin.  This  property  of  a 
repression  domain,  that  is,  the  ability  to  mediate  long- 
range  versus  short-range  repression  has  emerged  as  a  key 
determinant  of  the  biological  function  for  a  repressor 
protein  (Gray  and  Levine  1996).  KRAB-mediated,  short- 
range  repression  was  accompanied  by  highly  localized 
chromatin  compaction  and  a  physical  association  of  the 


KPHBD  protein  at  the  PAX3  DNA  recognition  se¬ 
quences.  The  KAP1  and  HP1  proteins  were  enriched  at 
the  TK  promoter.  Remarkably,  the  physical  association 
of  these  proteins  appears  to  only  span  a  few  nucleosomes 
because  a  region  -1.2  kbp  away  was  devoid  of  cross-link¬ 
able  protein.  Such  a  short-range  repressor  property  is  also 
exhibited  by  the  retinoblastoma  (Rb)  repressor  protein, 
which  has  been  shown  to  remodel  a  single  nucleosome 
at  the  promoter  region  (Morrison  et  al.  2002).  In  light  of 
these  results,  it  is  remarkable  that  we  also  detect  spatial 
relocalization  of  the  gene  in  the  interphase  nucleus  to  a 
block  of  constitutive  heterochromatin.  Intuitively,  one 
would  think  that  such  a  long-range  relocalization  adja¬ 
cent  to  a  large  block  of  A-T-rich  condensed  chromatin 
would  have  long-range  silencing  effects  on  the  gene. 
While  one  caveat  to  this  result  is  the  potential  presence 
of  a  cryptic  insulator/boundary  element  between  the  lu¬ 
ciferase  and  zeocin  genes,  we  favor  the  notion  that  si¬ 
lencing  mediated  by  a  short-range  repressor  may  occur 
by  highly  localized  looping  into  heterochromatic  envi¬ 
ronments  (Seum  et  al.  2001). 

KRAB-KAP1-HP1  -mediated  gene  silencing  is 
mitotically  heritable 

The  most  striking  finding  in  this  study  is  that  transgene 
silencing  is  apparently  mitotically  heritable  in  cell  cul¬ 
ture.  This  is  the  hallmark  property  that  distinguishes 
HP  1 -dependent  PEV  from  other  mechanisms  of  repres¬ 
sion.  Instead  of  physical  linkage  via  translocation  of  the 
euchromatic  gene  to  adjacent  heterochromatin  as  occurs 
in  PEV,  we  have  relied  on  transient  targeting  of  HP1  to  a 
transcribed  locus  using  the  KRAB-KAP1  system.  Our  re¬ 
sults  suggest  that  a  pulse  of  KRAB-KAP-HP1  protein 
induces  a  stable,  silenced  state  that  can  be  detected  at 
high  frequency  in  clonal  subpopulations  following 
growth  of  >40  cell  generations  in  the  absence  of  hor¬ 
mone.  Because  the  biological  effects  of  HBD  fusion  pro¬ 
teins  are  readily  reversible  following  removal  of  hor¬ 
mone,  we  conclude  that  the  stable  silencing  we  observe 
is  being  maintained  in  the  absence  of  the  KPHBD  DNA 
binding  activity.  This  is  further  supported  by  the  obser¬ 
vation  that  SNAG  domain-mediated  repression  (which 
does  not  involve  HP1)  is  completely  reversible  by  hor¬ 
mone  withdrawal  under  the  same  experimental  condi¬ 
tions. 

A  key  observation  is  that  silencing  did  not  appear  to 
spread  along  the  template  into  the  adjacent  zeocin  tran¬ 
scription  unit.  Spreading  is  commonly  observed  in  peri- 
centromeric  heterochromatin-mediated  PEV  in  Dro¬ 
sophila.  Because  we  continually  selected  for  zeocin  ex¬ 
pression  (via  drug  selection)  during  growth  of  the  single¬ 
cell  clones  after  the  4-OHT-pulse,  this  selection  pressure 
may  have  both  impeded  spreading  and  the  establishment 
of  a  larger  domain  of  silencing.  Therefore,  we  repeated 
the  clonal  analysis  in  the  absence  of  zeocin  to  determine 
if  a  higher  frequency  of  stably  silenced  clones  could  be 
observed.  However,  the  same  frequency  of  stably  si¬ 
lenced  clones  was  observed  in  the  absence  of  zeocin 
when  compared  to  the  +zeocin  experiment  (data  not 
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shown).  We  conclude  that  silencing  of  euchromatic 
genes  via  localized  recruitment  of  HP1  may  be  unique  to 
KRAB  repressor  domain  and  also  that  KRAB  is  funda¬ 
mentally  different  from  the  other  repressor  domains  in 
this  property. 

HP1  mediated  silencing  of  euchromatic  genes 

Our  study  is  one  of  the  first  to  document  the  physical 
characteristics  of  a  variegating  euchromatic  gene  re¬ 
pressed  by  HP1  in  a  mammalian  cell.  Previously,  a  mu¬ 
rine  CD2  transgene  reporter  was  shown  to  variegate  in 
the  thymus  in  response  to  HP1  dose  (Festenstein  et  al. 
1999).  Interestingly,  variegation  could  be  either  en¬ 
hanced  or  repressed  by  HP1  dose,  depending  upon  inte¬ 
gration  site  (Festenstein  et  al.  1999).  Recently,  a  set  of 
apparently  euchromatic  Drosophila  genes  were  identi¬ 
fied  that  may  be  targets  for  dose-dependent  HP1  regula¬ 
tion  and  can  be  affected  by  known  modifiers  of  Dro¬ 
sophila  PEV  (Hwang  et  al.  2001).  However,  it  is  unclear 
if  they  variegate  and/or  what  the  nature  of  their  chroma¬ 
tin  structure  is.  A  common  theme  among  these  and 
other  studies  of  PEV  is  that  variegation  is  extremely  sen¬ 
sitive  to  HP1  dosage  (Eissenberg  et  al.  1992;  Hwang  et  al. 
2001).  It  will  be  important  to  determine  if  this  HP1  dose 
sensitivity  holds  true  for  repression  of  a  mammalian  eu¬ 
chromatic  gene  and  if  it  can  influence  the  stable  compo¬ 
nent  demonstrated  in  our  system. 


The  interplay  between  histone  methylation  and 
DNA  methylation 

Our  comparative  ChIP  analysis  of  silent  and  active 
clones  clearly  shows  enrichment  of  KAP1,  SETDB1, 
HP1,  and  H3-MeK9  at  the  silenced  lucif erase  transgene 
promoter,  all  of  which  are  likely  to  contribute  to  the 
stably  silenced  phenotype.  However,  most  remarkable  is 
the  finding  of  increased  DNA  methylation  in  the  stably 
silenced  clone  also  centered  on  the  proximal  promoter 
region  bound  by  these  proteins.  That  this  DNA  methyl¬ 
ation  contributes  to  the  silencing  is  suggested  by  the 
potent  reactivation  of  the  silent  locus  by  the  combined 
action  of  5  AC  +  TSA.  This  observation  suggests  that  H3- 
MeK9  methylation  and  HP1  recruitment  on  a  euchro¬ 
matic  gene  may  ultimately  lead  to  DNA  methylation. 
Identifying  which  component  of  the  histone-directed  si¬ 
lencing  machinery  serves  as  the  signal  for  recruitment 
and/or  activation  of  the  DNA  methyltransferases  re¬ 
quired  for  carrying  out  the  DNA  modification  will  be 
critical  for  understanding  this  system.  Moreover,  once 
the  DNA  methylation  mark  is  established,  it  likely  plays 
an  active  role  in  maintaining  the  histone-directed  ma¬ 
chinery  at  the  silent  locus.  This  could  be  accomplished 
in  two  ways.  First,  the  MBD2/3  component  of  the  NuRD 
histone  deacetylase  complex  could  directly  bind  the 
methylated  DNA  and  maintain  HDAC  activity  at  the 
locus.  Second,  in  addition  to  the  catalytic  SET  domain, 
the  SETDB1  protein  also  encodes  a  CpG  DNA  methyl 
binding  domain,  which,  if  functional,  would  maintain 


the  H3-MeK9  activity  at  the  locus  (Schultz  et  al.  2002). 
That  this  may  occur  is  supported  by  our  preliminary  ob¬ 
servation  that  5AC  +  TSA  reactivation  is  accompanied 
by  rapid  loss  of  SETDB1  and  H3-MeK9  at  the  locus  as 
assessed  by  ChIP  assays  (K.  Ayyanathan  and  F.J. 
Rauscher,  unpubl.).  In  summary,  the  development  of  this 
tissue  culture  model  will  provide  a  valuable  system  to 
study  in  detail  the  sequence  of  events  and  the  coordi¬ 
nated  interplay  between  histone  methylation,  HP1  depo¬ 
sition,  and  DNA  methylation. 

Materials  and  methods 

Expression  and  reporter  plasmids 

The  construction  of  the  pcKRAB-PAX3  (pcKP)  plasmid  has  been 
described  (Ayyanathan  et  al.  2000).  The  pcKRAB-PAX3-HBD 
(KPHBD)  plasmid  was  constructed  by  fusing  the  estrogen  recep¬ 
tor  (ERHBD™)  in  frame  with  the  COOH  terminus  of  the  PAX3 
DNA  binding  domain  (DBD)  at  a  unique  EcoRI  site.  The  pcK- 
RAB(DV)-PAX3-HBD  (K(DV)PHBD)  plasmid  was  constructed 
by  subcloning  a  mutant  KRAB  (DV18  19AA)  domain  (Margolin  et 
al.  1994)  as  a  ffindlll/BamHI  fragment  into  pcKRAB-PAX3- 
HBD  plasmid,  thus  replacing  the  wild-type  KRAB  domain.  A 
similar  strategy  was  utilized  to  construct  the  pcSNAG-PAX3- 
HBD  (SPHBD),  pcPLZF ( POZ (-PAX3-HBD  (PPHBD),  pcEngrailed- 
PAX3-HBD  (EPHBD),  and  pcWTl-PAX3-HBD  (WPHBD)  ex¬ 
pression  plasmids.  The  pcVP16-PAX3  (VPDBD)  expression 
plasmid  was  constructed  by  inserting  the  VP16  acidic  activation 
domain  (residues  1-98)  using  a  similar  approach.  All  PCR-de- 
rived  nucleotide  sequences  and  the  appropriate  fusion  junctions 
were  confirmed  by  sequencing  both  DNA  strands.  A  Pvull  frag¬ 
ment  (zeocinR  cassette)  from  pcDNA3.1/Zeo  plasmid  (Invitro- 
gen)  was  cloned  into  a  unique  Pvull  site  in  the  CD19-TK-LUC 
plasmid  to  generate  the  CD19-TK-LUC-ZeoR  plasmid  (Ayya¬ 
nathan  et  al.  2000). 

Antibodies 

The  sources  of  antibodies  are:  a-PAX3  IgG  (Fredericks  et  al. 
2000),  affinity-purified  a-KAPl  (raised  against  amino  acids  20- 
418;  Schultz  et  al.  2001),  a-SETDBl  (Schultz  et  al.  2002),  a-H3- 
MeK9  (Abeam,  Inc.;  this  antibody  recognizes  di-methyl  Lys  9), 
a-NT2  (raised  against  amino  acids  323-345;  Tanaka  et  al.  2002), 
and  a-HPla  and  a-HPly  IgGs.  The  a-HPla  and  a-HPl-y  are 
mouse  monoclonal  antibodies,  which  were  produced  using 
6-HIS-tagged,  full-length  human  antigens.  These  reagents  do 
not  cross-react  with  the  other  human  or  mouse  HP1  orthologs 
and  will  be  described  elsewhere  (D.C.  Schultz,  M.S.  Lechner,  K. 
Ayyanathan,  and  F.J.  Rauscher  III,  unpubl.). 

Transient  transfections  and  luciferase  reporter  assays 

Protein  expression  was  confirmed  by  transient  transfection  of 
COS-1  cells  followed  by  immunoprecipitation  of  the  [35S]-L- 
methionine-labeled  extracts  with  a-PAX3  IgG  (Ryan  et  al.  1999; 
Ayyanathan  et  al.  2000).  4-OHT-dependent  repression  poten¬ 
tials  were  monitored  via  transient  transfection  as  previously 
described  (Ayyanathan  et  al.  2000).  Cells  were  treated  with  ve¬ 
hicle  (0.1%  ethanol)  or  500  nM  4-OHT  (Research  Biochemicals 
International).  Fold  repression  was  determined  as  the  ratio  of 
normalized  light  units  in  vehicle-treated  cells  versus  4-OHT 
treated  cells. 

For  4-OHT  washout  experiments,  cells  were  seeded  at 
0.5  x  104  in  60-mm  dishes.  Duplicate  dishes  of  cells  were  then 
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treated  with  either  500  nM  4-OHT  (+OHT)  or  0.1%  ethanol 
(-OHT)  for  the  indicated  time.  At  the  end  of  induction,  one  dish 
of  -OHT  and  +OHT-treated  cells  were  harvested  while  the  re¬ 
maining  dishes  were  subjected  to  three  washes  each  day.  On 
fourth  day,  the  cells  were  harvested  by  trypsinization  and  re¬ 
plated  into  new  dishes.  Washings  were  continued  for  an  addi¬ 
tional  4  d.  Cells  were  harvested  daily  and  normalized  luciferase 
activities  determined. 

Generation  and  characterization  of  stably  transfected 
cell  clones 

NIH3T3  cell  lines  containing  a  stably  expressed  chimeric  re¬ 
pressor  and  the  CD19-TK-LUC-ZeoR  reporter  plasmid  were 
generated  by  cotransfection  and  selection  in  growth  medium 
containing  500  pg/mL  G418  +  100  pg/mL  zeocin.  Colonies  were 
ring-cloned,  expanded,  and  tested  for  luciferase  activity.  The 
luciferase  activities  were  normalized  to  protein  concentration 
and  expressed  as  light  units/O.D.  at  A595.  Stable  expression  of 
the  chimeric  RD-PAX3-HBD  proteins  in  clones  was  tested  by 
immunoprecipitation  with  a-PAX3. 

Quantitative  RT-PCR  (Q-RT-PCR) 

Total  RNAs  were  isolated  using  TRIzol  Reagent  (Life  Technolo¬ 
gies)  and  oligo-dT  primed  first-strand  cDNAs  produced.  Lucif¬ 
erase,  neomycinR,  zeocinR,  NT2,  Collla2,  and  RXR(3  mRNAs 
were  amplified  by  PCR  for  the  number  of  cycles  indicated. 
The  respective  primer  pairs  were:  LUC2  (5'-CAAGGATAT 
GGGCTC AC-3 ' )  and  LUC3  (5'-GACCTTTCGGTACTTCG- 
3'),  NEOl  (5 ' -TC AGCGC AGGGGCGCCCGGTTCTTT-3 ' )  and 
NE02  (5'-ATCGACAAGACCGGCTTCCATCCGA-3'),  ZEOl 
(5 '-ATGGCCAAGTTGACC AG-3 ' )  and  ZE02  (5'-TCAGTCC 
TGCTCCTCG-3 ' ),  NT2-1  (5 ' -GCC AGGCTAG AAGGGAGG- 
3')  and  NT2-2  ( 5 ' -GGTGTCTGTTGAGGTTGG-3 ' ),  COL1  (5'- 
GGCCTC AGCCTAGC AGATGG-3 ' )  and  COL2  (5'-GGCTTA 
TGAAGTCTTGCTGG-3 ' ),  RXR1  ( 5 '  -GGCTCTGTGC AATC 
TGCGGG-3')  and  RXR2  (5'-GTCCACAGGCATCTCCTCAG 
GG-3').  Following  electrophoresis  and  Southern  blotting,  the 
images  were  quantified  using  ImageQuant  (Molecular  Dynam¬ 
ics). 

In  vivo  analysis  of  chromatin  structure 

Nuclei  were  prepared  essentially  as  described  (Mymryk  et  al. 
1997).  For  the  restriction  enzyme  accessibility  assays,  the  nuclei 
were  diluted  into  appropriate  1  x  restriction  enzyme  buffers  in  a 
500-pL  reaction  volume  and  digested  with  250  units  of  the  in¬ 
dicated  enzymes  for  10  min  at  room  temperature.  The  reaction 
was  terminated  by  adding  100  pg  of  proteinase-K  in  10  mM 
Tris-HCl  at  pH  8.0,  10  mM  EDTA,  10  mM  NaCl,  and  incubated 
at  37°C  for  14-16  h.  These  samples  were  phenolxhloroform 
extracted  and  ethanol  precipitated.  The  purified  DNA  was  di¬ 
gested  with  BamHI  to  completion  and  purified.  Equal  amounts 
of  the  DNA  samples  were  taken  for  reiterative,  primer  exten¬ 
sion  PCR  reactions  with  the  following  32P-labeled  primers: 
LUC1  ( 5 '  -TCC  AGGAACC  AGGGCGTATCTCT-3 ' )  or  ZE02  or 
NE02.  Primer  extension  products  were  extracted  with  phenol¬ 
xhloroform  and  ethanol  precipitated.  Dried  DNA  pellets  were 
dissolved  in  formamide  gel-loading  buffer  and  electrophoresed 
in  7M  Urea/5  %  acrylamide  gels  in  TBE  buffer.  The  gels  were 
fixed  in  10%  acetic  acid,  dried,  and  autoradiographed. 

ChIP 

Soluble,  sonicated  chromatin  was  prepared  essentially  as  de¬ 
scribed  (Schultz  et  al.  2002).  Chromatin  fractions  were  immu- 


noprecipitated  with  10  pg  of  indicated  antibodies  and  the  im¬ 
mune  complexes  were  recovered  using  protein-A  sepharose 
beads  and  processed  as  described  (Schultz  et  al.  2002).  The  im- 
munoprecipitated  and  input  DNA  samples  were  used  in  quan¬ 
titative  PCR  using  the  following  oligonucleotides:  LUC1, 
LUC2,  LUC3,  ZE02,  PBS3  (5 ' -GAATAC ACGGAATTGGA 
TCCG-3'),  PBS1  (5  '-GATCGATAATTCGAGCTACTG-3 ' ), 
PBS2  (5 '-GAGCTCGGTACCCGGGTCG-3 ' ),  TKP1  (5'-GCGC 
GGTCCC AGGTCC ACTT -3 ' ),  SVP1  (5'-CCAGTTCCGCCCA 
TTCTCCC-3'),  Cl  ( 5 ' -GGATGCTGCC ACGGCCTGAGG-3 ' ), 
C2  (5 ' -GGGTCTGCC AGGAGCCTGTGG-3 ' ),  C3  (5'-GGGTC 
GCTATCTATAGCTGG-3'),  C4  (5'-GTCCTTTCACACCAC 
GGCAG-3'),  C5  (5'-GGCCTCAGCCTAGCAGATGG-3'),  C6  (5'- 
GAGCACTC AGACCTTCCAGAGG-3 ' ),  C7  (5'-GGAACATCA 
GGTGGTGACGG-3 ' ),  C8  (5'-GCAGCCCATCCTTCCCTGC 
AGG-3').  The  PCR  products  were  resolved  on  1.5%  agarose  gels 
and  the  bands  were  quantified  using  IQMac  vl.2  software. 

FISH 

For  detection  of  the  luciferase  gene,  either  0.1%  ethanol  (-OHT) 
or  500  nM  4-OHT  (+OHT)  treated  KPHBD21  cells  were  fixed  in 
4%  paraformaldehyde,  permeabilized  in  0.2%  Triton  X-100,  and 
then  treated  with  RNase.  After  equilibration  in  2x  SSC,  cells 
were  dehydrated  in  an  ethanol  series  (70%,  80%,  and  100%). 
The  CD19-TK-LUC  plasmid  was  labeled  with  biotin- 16-dUTP 
by  nick-translation  and  dissolved  at  10  ng/pL  in  50%  for¬ 
mamide  in  2x  SSC  containing  10%  dextran  sulfate,  100  ng/pL 
salmon  sperm  DNA,  1  pg/pL  yeast  tRNA,  and  1.5  pg/pL  Cot-1 
DNA.  Both  the  probe  and  the  cells  were  simultaneously  heated 
at  91°C  for  4  min  to  denature  DNA  and  incubated  overnight  at 
37°C.  After  hybridization,  specimens  were  serially  washed  at 
37°C  with  50%  formamide  in  2x  SSC,  2x  SSC,  and  0.25x  SSC. 
Hybridized  probes  were  detected  with  FITC-streptavidin  (Vec¬ 
tor  Laboratories).  Finally,  cells  were  equilibrated  in  PBS,  stained 
for  DNA  with  either  DAPI  or  Hoechst  (2  ng/mL)  and  mounted 
in  Fluoromount  G  (Southern  Biotechnology  Associates,  Inc). 
Cells  were  analyzed  with  a  Leitz  Fluovert  inverted  microscope 
equipped  with  a  digital  camera.  Images  were  obtained  using 
QED  Imaging  software. 

Clonal  analysis  of  luciferase  gene  expression 

Three  independent  subclones  (KPHBD21-8,  KPHBD21-39,  and 
KPHBD21-49)  that  express  high-level  luciferase  activities  were 
treated  with  either  500  nM  4-OHT  (+OHT)  or  0.1%  ethanol 
(-OHT)  for  4  d  continuously  and  subjected  to  limiting  dilution 
cloning  in  96-well  plates.  Wells  containing  single  cells  were 
identified  and  the  cells  were  propagated  for  -40  doublings  in 
normal  growth  medium.  Whole-cell  lysates  were  prepared  from 
these  individual  progeny  and  normalized  luciferase  activities 
were  determined. 

DNA  methylation  analysis 

MSPCR:  Genomic  DNAs  were  extracted  from  the  silent  (cl  39- 
40)  and  active  (cl  39-45)  clones,  and  1  pg  of  each  was  treated 
with  sodium  bisulfite  to  convert  the  unmethylated  cytosines  to 
uracil  as  described  (Herman  et  al.  1996).  These  DNAs  were  PCR 
amplified  using  unmethylated  sense  (US;  5'-GTTGATTTGGG 
TATTGAGTTTGAG-3')  or  methylated  sense  (MS;  5'-GTCGA 
TTCGGGTATCGAGTTCG-3')  primers  (present  in  the  TK  pro¬ 
moter)  and  an  antisense  primer  (UMA1;  5'-GGTTTTATTTTT 
TAGAGGATAGAATGG-3';  present  near  the  start  of  luciferase 
cDNA).  The  PCR  products  were  electrophoresed  on  a  1.5%  aga¬ 
rose  gel  and  photographed. 
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Sodium  bisulfite-genomic  sequencing:  The  sodium  bisulfite- 
modified  DNAs  were  PCR  amplified  using  UMS1  (sense;  5'- 
GTTTTAGTGTTTTATGTTTTAGG-3 ' )  and  UMA1  (anti- 
sense)  primers  that  were  present  in  regions  free  of  any  CpG 
residues  (to  avoid  any  preferential  amplification).  The  PCR 
products  were  TA-cloned  into  the  pCR  II  vector.  At  least  18 
recombinant  clones  of  each  were  sequenced  in  both  directions. 
Results  are  presented  for  seven  representative  clones  in  each 
category. 
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Epithelial-mesenchymal  transition  (EMT)  is  an  important  mechanism  for  phenotypic  conversion  in  normal 
development  and  disease  states  such  as  tissue  fibrosis  and  metastasis.  While  this  conversion  of  epithelia  is 
under  tight  transcriptional  control,  few  of  the  key  transcriptional  proteins  are  known.  Fibroblasts  produced 
by  EMT  express  a  gene  encoding  fibroblast-specific  protein  1  (FSP1),  which  is  regulated  by  a  proximal  ex¬ 
acting  promoter  element  called  fibroblast  transcription  site-1  (FTS-1).  In  mass  spectrometry,  chromatin 
immunoprecipitation,  and  siRNA  studies,  we  used  FTS-1  as  a  unique  probe  for  mediators  of  EMT  and  identi¬ 
fied  a  complex  of  2  proteins,  CArG  box-binding  factor-A  (CBF-A)  and  KRAB-associated  protein  1  (KAP-1), 
that  bind  this  site.  Epithelial  cells  engineered  to  conditionally  express  recombinant  CBF-A  (rCBF-A)  activate  the 
transcription  of  FSP1  and  undergo  EMT.  The  FTS-1  response  element  also  exists  in  the  promoters  modulating 
a  broader  EMT  transcriptome,  including  Twist,  and  Snail,  as  well  as  E-cadherin,  \\-catenin,  ZO  1,  vimentin,  a  1(1) 
collagen,  and  a -smooth  muscle  actin,  and  the  induction  of  rCBF-A  appropriately  alters  their  expression  as  well. 
We  believe  formation  of  the  CBF-A/KAP-l/FTS-1  complex  is  sufficient  for  the  induction  of  FSP1  and  a  novel 
proximal  activator  of  EMT. 


Introduction 

The  mechanisms  governing  molecular  signals  for  epithelial- 
mesenchymal  transition  (EMT)  are  increasingly  more  complex 
(1,  2).  Fibrogenesis  during  wound  healing  or  following  organ 
inflammation  depends  on  the  formation  and  proliferation  of 
new  fibroblasts  by  EMT.  We  previously  described  a  gene  encod¬ 
ing  fibroblast-specific  protein  1  (FSP1)  that  activates  in  epithelia 
during  EMT  and  is  constitutively  and  selectively  present  in  newly 
formed  fibroblasts  thereafter;  FSP1,  also  known  as  S100A4  in  the 
cancer  literature,  is  an  intracellular  calcium-binding  protein  whose 
appearance  is  linked  to  EMT  (3-7),  tissue  fibrosis  (4,  8),  pulmonary 
vascular  disease  (9),  increased  tumor  cell  motility  and  invasive¬ 
ness  (10),  and  metastatic  tumor  development  (11-16).  FSP1  helps 
epithelia  transition  to  new  morphology  and  motility  based  on  its 
ability  to  influence  levels  of  intracellular  calcium  and  actin  disas¬ 
sembly  when  transfected  into  cultured  cells.  The  important  role  of 
FSP1  in  EMT  is  underscored  by  findings  that  induction  of  EMT 
in  vitro  by  epithelial  growth  factor  (EGF)  and  TGF-p  is  blocked  by 
antisense  oligomers  against  mRNA  encoding  FSP1  (6)  and  that  lev¬ 
els  of  mRNA  encoding  E-cadherin  are  inversely  correlated  with  FSP1 
expression  in  invasive  lines  of  squamous  cell  carcinoma  (17). 

The  transcriptional  control  of  EMT  diversifies  the  lineage 
specification  of  epithelia  during  development  (18),  lineage  com- 
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mitment  of  epithelia  in  tissue  fibrosis  (1),  and  the  stimulation  of 
tumor  metastasis  (2,  11,  19).  An  increasing  number  of  transcrip¬ 
tion  factors  appear  to  activate  EMT  in  various  settings,  including 
Snail  (20),  Twist  (1 1),  high  mobility  group  A2  (HMGA2)  (21),  Slug 
(16),  and  Ets-1  (22,  23).  However,  the  set  of  transcription  factors 
controlling  EMT  is  far  from  complete. 

In  an  earlier  report  using  promoter  deletion  constructs  for  FSP1, 
we  identified  a  novel  as-acting  element  in  the  FSP1  promoter  called 
fibroblast  transcription  site-1  (FTS-1),  which  activates  transfected 
reporters  in  fibroblasts  and  forms  specific  complexes  in  EMSAs 
with  nuclear  extracts  (24).  Using  both  EMSA  and  DNA  affinity 
chromatography  followed  by  mass  spectrometry  analysis  (25),  we 
identified  2  key  transcriptional  proteins  in  this  complex,  namely 
the  CArG  box-binding  factor-A  (CBF-A)  (26)  and  the  KRAB-asso¬ 
ciated  protein  1  (KAP-1)  (27).  These  proteins  form  a  ternary  com¬ 
plex  with  FTS- 1  to  engage  the  transcription  of  FSP1  and  play  a  key 
functional  role  in  forming  fibroblasts  by  EMT. 

Results 

Purification  of  the  FTS-l-hinding  complex.  In  our  original  report  of 
FTS-1  (24),  we  used  a  100-bp  oligonucleotide  from  the  proximal 
FSP1  promoter  to  identify  the  fibroblast-specific  complex  by  EMSA. 
To  minimize  the  background  while  ensuring  specificity  of  binding 
to  the  5-nucleotide  FTS-1  motif,  this  time  we  used  a  25-bp  sequence 
containing  the  core  pentanucleotide  (TTGAT)  with  adjacent  pro¬ 
moter  sequences.  Complexes  identified  by  EMSA  using  both  oligo¬ 
nucleotides  and  fibroblast  nuclear  extracts  had  a  similar  number 
and  pattern  of  distribution  when  run  in  parallel  (Figure  1A);  differ¬ 
ences  in  the  relative  mobility  of  the  bands  arose  from  the  different 
lengths  of  the  oligonucleotides.  The  4  complexes  were  competed 
to  varying  degree  by  challenge  with  20x  molar  excess  unlabeled 
probe.  The  specificity  of  the  complexes  formed  with  the  25-bp 
FTS-1  oligonucleotide  was  further  tested  by  competition  analysis 
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Figure  1 

Identification  and  purification  of  the  FTS-1  complex 
by  EMSA  and  DNA  affinity  chromatography.  (A) 
Similar  complexes  form  in  EMSA  with  fibroblast 
nuclear  protein  and  either  25-bp  or  1 00-bp  probes, 
both  containing  the  FTS-1  core  pentanucleotide. 
The  complexes  were  challenged  with  nonlabeled 
probes.  (B)  Specificity  of  the  FTS-1  complex 
established  by  EMSA  following  challenge  by  vari¬ 
ous  nonlabeled  competitors.  Fibroblast  nuclear 
protein  was  incubated  with  the  25-bp  radiolabeled 
FTS-1  or  its  mutant  form  (m).  The  complexes  were 
challenged  with  DNA  probes  specific  for  differ¬ 
ent  regulatory  sites.  The  complex  specific  to  the 
TTGAT  pentanucleotide  (marked  1)  is  at  the  top; 
(m)  indicates  the  mutated  oligonucleotides.  (C) 
The  FTS-1  complex  (marked  1 )  is  formed  by  MCT 
cells  induced  to  EMT  as  evidenced  by  EMSA  at 
different  times  after  induction.  A  fibroblast-derived 
FTS-1  complex  is  shown  as  a  positive  control.  (D) 
DNA  affinity-purified  proteins  form  only  1  complex 
with  FTS-1  (marked  1)  in  EMSA.  Fractions  purified 
by  the  direct  or  indirect  approach  were  eluted  with 
0.5  M  KCi  followed  by  0.5  M  KCI  plus  5  M  urea, 
dialyzed  against  0.1  M  KCI,  and  incubated  with 
radiolabeled  FTS-1  or  its  mutated  form  (m).  The 
flow-through  (FT)  from  the  direct  incubation  could 
not  form  the  FTS-1  complex. 


using  related  and  unrelated  probes  (Figure  IB).  Complexes  formed 
with  radiolabeled  FTS-1  or  its  mutant  (CCAGC  instead  ofTTGAT) 
were  competed  at  20x  molar  excess  with  unlabeled  species  or  with 
consensus  sites  for  the  Y-box  transcription  factor  (YB-1)  and  serum 
response  factor  (SRE).  The  YB-1  core  sequence  differs  in  1  nucleo¬ 
tide  from  FTS-1  (TTGGT  versus  TTGAT),  and  SRE  is  unrelated  to 
FTS-1  and  contains  a  CarG  box  DNA  motif.  Mutant  constructs  of 
these  oligonucleotides  were  also  used  in  the  competition  analysis. 
Of  the  4  complexes  formed  by  the  FTS-1  probe,  the  more  slowly 
migrating  complex  1  was  not  competed  by  its  mutant  and  the  only 
one  not  formed  by  a  radiolabeled  mutant.  Complex  1  was  competed 
by  YB- 1  because  of  structural  similarity,  but  when  used  as  a  probe, 
YB-1  did  not  form  a  similar  complex  in  EMSA  (data  not  shown). 
Complex  1  was  also  unaffected  by  challenge  with  the  YB- 1  mutant 
or  SRE.  Competition  analysis  confirmed  this  complex  as  specific 
for  the  core  FTS-1  site  (TTGAT).  Unlabeled  competitors  did  not 
affect  complex  4.  Complexes  2  and  3,  although  not  competed  by 
YB-1  or  by  SRE,  were  formed  with  both  FTS-1  and  its  mutant  (Fig¬ 
ure  IB).  Therefore,  only  complex  1  was  specific  for  FTS-1.  Complex 
1  also  formed  in  EMSA  when  the  probe  was  incubated  with  nuclear 
extracts  from  kidney  epithelial  cells  undergoing  EMT  (Figure  1C) 
in  parallel  with  the  expression  of  FSP1  (6). 

These  results  were  confirmed  in  another  epithelial  cell  line  from 
mouse  inner  medullary  collecting  duct  (mlMCD;  ref.  28).  mlMCD 
cells  stimulated  to  undergo  EMT  with  TGF-|3/EGF  formed  FTS-1 
complexes  on  day  5  (data  not  shown). 

The  FTS-l-binding  complex  identified  by  EMSA  was  purified 
independently  using  DNA  affinity  chromatography.  We  used  a 
biotinylated  tetramer  of  the  25-bp  FTS-1  sequences  attached  to 


streptavidin-coated  magnetic  beads.  The  binding  reactions  were 
performed  under  the  same  conditions  as  the  EMSAs  to  ensure 
comparability.  Initially  we  used  2  experimental  variants:  a  direct 
incubation  of  nuclear  extracts  with  the  FTS-1  oligonucleotide;  and 
an  indirect  approach  using  the  flow-through  from  an  initial  incu¬ 
bation  with  the  mutant  tetramer  for  a  secondary  incubation  with 
the  TTGAT  tetramer.  Proteins  that  bind  specifically  were  eluted 
with  high  salt,  with  or  without  urea,  and  analyzed  by  EMSA  using 
radiolabeled  mutant  or  wild-type  FTS-1  (Figure  ID).  The  results 
of  this  analysis  demonstrate  that  affinity-purified  proteins  were 
able  to  form  a  DNA-protein  complex,  which  comigrated  with  com¬ 
plex  1.  There  was  no  difference  in  the  complexes  formed  by  the 
proteins  purified  by  either  approach,  and  subsequently  we  used 
only  the  indirect  approach  to  minimize  contaminating  proteins. 
These  results  established  that  both  EMSA  and  DNA  affinity  chro¬ 
matography  consistently  identified  the  same  FTS-1  complex.  The 
complex  recovered  both  from  EMSA  and  from  DNA  affinity  puri¬ 
fication  was  used  separately  for  protein  identification  utilizing 
tandem  mass  spectrometry  (25). 

Proteomic  analysis  of  FTS-1  complexes.  The  combination  of  EMSA  with 
mass  spectrometry  identified  several  protein  signatures.  Two  inde¬ 
pendent  peptides  from  the  FTS-1  complex  (Table  1)  were  identified 
as  CBF-A  (26).  Two  other  peptides  belonged  to  the  tripartite  motif 
protein  28  (TIFip/Trim28),  a  mouse  homolog  of  KAP-1  (27).  Other 
identified  peptides  were  from  the  eukaryotic  elongation  factor  1  al 
(EEF1A1)  and  the  DNA  replication-licensing  factor  MCM3. 

A  second  technique,  DNA  affinity  chromatography  followed  by 
mass  spectrometry,  also  identified  CBF-A  and  KAP-1  in  the  FTS-1 
complex.  Three  independent  CBF-A  peptides  were  identified; 
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Table  1 

Peptide  identification  in  the  FTS-1-binding  complex 


Protein 

Accession  no. 

Peptide  sequence 

Purification  by  EMSA  Purification  by  DNA  affinity  chromatography 

CBF-A 

Q99020 

GFVF1TFKEEDPVK,  IFVGGLNPEATEEK  GFVFITFKEEDPVK,  IFVGGLNPEATEEK, 

EVYQQQQYGSGGR 

Tripartite  motif  protein  28 

Q62318 

EEDGSLSLDGADSTGVVAK, 

EEDGSLSLDGADSTGVVAK, 

(TIF1  |3/Trim28)/KAP-1 

LDLDLTSDSQPPVFK 

LDLDLTSDSQPPVFK 

EEF1A1  (EF-1  ctl/EF-Tu) 

PI  01 26 

EHALLAYTLGVK,  IGGIGTVPVGR,  RYEEIVK, 
STTTGHLIYK,  THIIMIVVIGHVDSGK,  YYVTIIDAPGHR 

None 

2  are  shared  by  both  purification  approaches,  and  a  third  one, 
EVYQQQQYGSGGR,  appeared  in  the  affinity-purified  complex 
(Table  1).  The  2  KAP-1  peptides  described  above  were  also  con¬ 
firmed.  However,  neither  EEF1A1  nor  MCM3  peptides  were  pres¬ 
ent  in  the  affinity-purified  complex. 

We  used  2-dimensional  EMSA/SDS-PAGE  followed  by  immu- 
noblotting  to  further  characterize  the  composition  of  the  FTS-1 
complex.  To  this  end,  we  ran  nuclear  extracts  in  EMSA  with  and 
without  incubation  with  an  FTS-1  probe  and  then  subjected  the 
gel  pieces  to  second-dimension  SDS-PAGE.  After  subsequent 
transfer  to  membrane  and  immunostaining,  we  confirmed  the 
presence  of  CBF-A  and  KAP-1  in  the  FTS-1  complex  (Figure  2A). 
Immunostaining  indicated  the  absence  of  CBF-A  and  KAP-1  in 
control  extracts,  as  shown  on  the  left  side  of  each  panel.  Only 
MCM3-positive  immunostaining  was  detected  on  the  left  side 
of  the  panel  (data  not  shown),  which  suggests  this  latter  protein 
does  not  form  a  complex  with  FTS- 1  and  only  comigrates  with  the 
complex  in  the  first  dimension.  An  antibody  against  EEF1A1  did 
not  detected  it  either  in  the  complex  or  as  a  comigrating  protein, 
suggesting  it  was  an  artifact. 

The  binding  of  CBF-A  and  KAP-1  to  the  FTS-1  probe  was  con¬ 
firmed  additionally  by  EMSA  supershifts  (Figure  2B).  The  CBF-A 
antibody  supershifts  the  FTS-1  complex,  while  antibody  against 
KAP-1  abolishes  it.  We  suspect  the  KAP-1  antibody  most  likely 
inhibits  sterically  the  interaction  of  CBF-A/KAP-1  with  FTS-1. 
Anti-rabbit  IgG  did  not  affect  the  mobility  or  integrity  of  the  FTS-1 
complex.  Finally,  we  performed  a  chromatin  immunoprecipitation 
(ChIP)  assay  with  the  CBF-A  antibody  using  a  rabbit  preimmune 
serum  as  a  negative  control.  Western  blots  of  precipitates  (Figure 
2C)  confirmed  the  presence  of  both  CBF-A  and  KAP-1  as  partici¬ 
pating  in  a  complex  with  FTS-1.  The  increased  expression  of  CBF-A 
and  KAP-1  in  cell  nuclei  during  EMT  was  further  demonstrated 
by  immunostaining  MCT  epithelia  stimulated  with  TGF-(5  and 
EGF  in  culture  (6);  the  nuclear  staining  for  CBF-A  increased  from 
14.8%  of  cells  in  unstimulated  epithelia  to  47.9%  in  EMT-induced 
fibroblasts,  and  nuclear  KAP-1  levels  increased  from  3.3%  to  30.2% 
(Figure  2D).  These  data  collectively  suggest  that  CBF-A  and  KAP-1 
are  components  that  recognize  the  FTS-1  element  in  cell  nuclei. 

rCBF-A  induces  de  novo  expression  ofFSPl  in  kidney  epithelial  cells. 
Differentiated  epithelial  cells  do  not  express  FSP1  nor  form  nucle¬ 
ar  complexes  with  FTS-1  (24).  When  they  undergo  EMT,  however, 
they  form  FTS-1  complexes  (Figure  1C)  and  synthesize  FSP1  (6). 
To  understand  the  functional  significance  of  CBF-A  in  the  induc¬ 
tion  of  FSPlj  we  expressed  recombinant  CBF-A  (rCBF-A)  in  epi¬ 
thelial  cells.  For  this  purpose,  the  kidney  epithelial  cell  line  MCT 
was  stably  transfected  with  a  TET-inducible  vector  for  conditional 


expression  of  rCBF-A  (MCT/rCBF-A).  We  selected  several  clones 
that  displayed  controlled  overexpression  of  rCBF-A  following 
exposure  to  doxycycline.  We  refer  to  the  noninduced  state  as  MCT/ 
rCBF-A~  and  to  the  doxycycline-induced  state  as  MCT/rCBF-A+. 
The  induction  of  rCBF-A  with  doxycycline  led  to  the  expression 
of  FSP1  transcripts  (Figure  3A)  and  the  coordinate  appearance  of 
FSP1  protein  (Figure  3D).  Overexpression  of  rCBF-A  resulted  in 
the  formation  of  the  FTS-1  complex,  as  confirmed  by  ChIP  assay 
with  CBF-A  and  KAP-1  antibodies  (Figure  3B).  Noninduced  cells 
maintain  low  levels  of  endogenous  CBF-A,  and  the  FTS-1  DNA 
site  could  not  be  detected  by  PCR.  However,  cells  induced  with 
doxycycline  formed  FTS-1  complexes  robustly  (Figure  3B).  Chro¬ 
matin  from  3T3  fibroblasts  was  immunoprecipitated  in  parallel  as 
a  positive  control  (Figure  3B,  left),  with  comparable  results.  Fur¬ 
thermore,  nuclear  extracts  from  MCT/rCBF-A+  cells,  but  not  from 
wild- type  MCT  cells,  formed  the  FTS-1  complex  in  EMSA  with 
affinity  similar  to  that  of  extracts  from  3T3  fibroblasts,  as  demon¬ 
strated  by  challenge  with  increasing  molar  amounts  of  unlabeled 
competitor  (Figure  3C).  These  results  indicate  that  overexpression 
of  rCBF-A  leads  to  formation  of  the  FTS-1  complex  and  activation 
of  FSP1  expression  in  epithelia  in  much  the  same  way  as  FSP1  is 
expressed  by  fibroblasts. 

We  further  confirmed  these  findings  in  an  in  vivo  kidney  model 
of  EMT  and  fibrosis  following  unilateral  ureteral  obstruction 
(UUO)  (4).  Obstructed  kidneys  undergoing  EMT  and  fibrosis 
after  12  days  (data  not  shown)  had  increased  levels  of  CBF-A, 
FSP1,  and  a-SMA  (P  <  0.002;  Figure  3E).  This  increase  in  CBF-A 
and  FSP1  was  accompanied  by  an  expected  decrease  in  E-cadherin 
expression.  The  expression  of  the  other  protein  in  the  FTS-1  com¬ 
plex,  KAP-1,  was  slightly  upregulated  in  the  UUO  kidney,  but  the 
increase  was  not  statistically  significant. 

Figure  3F  demonstrates  a  change  in  the  nuclear  localization 
of  CBF-A  in  fibrotic  renal  tissue  12  days  after  UUO  in  FSP1.GFP 
mice.  CBF-A  in  the  normal  contralateral  kidney  was  observed  in 
the  cytoplasm  (speckled  green)  of  tubular  cells  (nuclei  counter- 
stained  red  with  propidium  iodide);  FSP1+  fibroblasts  expressing 
GFP  under  the  control  of  the  FSP1  promoter  were  recolored  dark 
blue  in  these  images.  In  the  frames  labeled  UUOl  and  UU02,  the 
red  nuclei  have  become  yellow/orange  with  a  change  in  localiza¬ 
tion  of  CBF-A  from  the  cytoplasm  (percentage  of  nuclei  stained  for 
CBF-A  increased  21%  in  obstructed  kidneys  compared  with  4%  in 
contralateral  controls;  P  <  0.000 1),  and  the  fibroblasts  now  stained 
light  blue  with  the  addition  of  CBF-A.  This  series  of  images  sug¬ 
gests  that  the  nuclei  of  many  tubular  cells  in  the  UUO  kidney  are 
enriched  for  CBF-A  and  potentially  positioned  for  an  EMT  event. 
The  upregulation  of  CBF-A  in  UUO  kidney  established  by  quanti- 
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Figure  2 

CBF-A  and  KAP-1  are  integral  parts  of  the  FTS-1  complex.  (A)  Immunoblotting  and  2D  elec¬ 
trophoresis.  First  dimension:  EMSA;  second  dimension:  SDS-PAGE.  Left:  control  for  EMSA 
with  nuclear  protein  without  the  probe;  right  (FTS-1  complex):  protein  with  the  FTS-1  probe. 
The  absence  of  immunostaining  on  the  left  indicates  that  the  investigated  proteins  do  not 
comigrate  with  the  complexes  in  the  absence  of  DNA.  (B)  The  participation  of  CBF-A  and 
KAP-1  in  the  FTS-1  complex  was  confirmed  by  EMSA  with  their  antibodies.  The  complex  (*) 
was  supershifted  by  the  CBF-A  antibody  as  indicated  (**)  and  was  abolished  by  anti-KAP-1 
antibody;  rabbit  IgG  did  not  affect  the  complex.  (C)  ChIPof  3T3  chromatin  with  the  anti-CBF-A 
antibody  or  rabbit  preimmune  serum  (-).  Western  blotting  (upper  panel):  the  immunoblots 
were  probed  with  antibodies  against  KAP-1  and  CBF-A;  the  stained  IgG  is  shown  as  a 
loading  control.  PCR  (lower  panel):  Increasing  amounts  (2,  5,  and  10  yl)  of  DNA  eluted 
from  the  immunoprecipitates  were  used  as  template  in  PCR  with  primers  encompassing 
the  FTS-1  site  in  both  directions,  producing  a  fragment  of  approximately  280  bp.  Genomic 
DNA  from  3T3  fibroblasts  was  used  in  the  PCR  as  a  positive  control.  M,  DNA  size  ladder 
in  kilobases  (kb).  (D)  After  induction  of  EMT,  MCT  cells  showed  increased  nuclear  colocal¬ 
ization  of  KAP-1  and  CBF-A.  MCT  epithelia  were  stimulated  for  5  days  with  TGF-p/EGF 
in  culture.  Cells  were  stained  with  nuclear  propidium  iodide  dye  (red)  followed  by  indirect 
immunofluorescence  with  primary  antibodies  against  CBF-A  or  KAP-1  or  with  control  FITC- 
anti-rabbit  IgG  alone.  Original  magnification,  x630. 


a-SMA,  vimentin,  and  a  1(1)  collagen,  togeth¬ 
er  with  the  newly  identified  markers  of  EMT, 
N-cadherin,  Snail,  and  Twist  (Figure  3D).  The 
latter  2  proteins  are  transcription  factors  asso¬ 
ciated  with  increased  motility  of  transitional 
epithelial  cells  (11,  20).  The  levels  of  E-cad- 
herin,  [3-catenin,  and  ZO-1  were  attenuated 
correspondingly.  These  changes  in  protein 
levels  are  indicative  of  cell  remodeling  associ¬ 
ated  with  EMT  (1,2). 

The  differential  expression  of  epithelial 
and  fibroblast  markers  induced  by  rCBF-A 
was  confirmed  by  changes  in  levels  of  mRNA 
(data  not  shown)  and  by  immunostaining 
of  MCT/CBF-A+  compared  with  control  epi¬ 
thelial  cells  5  days  after  induction  (Figure 
4A).  The  pattern  of  expression  and  distribu¬ 
tion  of  ZO-1,  (3-catenin,  a-SMA,  and  vimen¬ 
tin  in  MCT/CBF-A+  cells  clearly  indicates  a 
change  of  phenotype  associated  with  FSP1 
expression.  Furthermore,  5  days  following 
the  induction  of  rCBF-A,  MCT/rCBF-A+  cells 
but  not  control  cells  assumed  a  spindle-like, 
fibroblast  morphology  typical  of  epithelial 
transition  (Figure  4B). 

The  change  in  phenotype  induced  by 
rCBF-A  was  associated  with  an  increased  abil¬ 
ity  of  MCT/rCBF-A+  fibroblasts  to  migrate 
directionally  following  serum  stimulation 
in  a  Boyden  chamber  assay  (Figure  4C).  Con¬ 
versely,  inhibition  of  CBF-A  expression  in  3T3 
fibroblasts  by  siRNA  caused  a  coordinated 
downregulation  of  CBF-A  and  FSP1  proteins 
(Figure  5).  Transiently  transfected  siRNAl 
best  inhibited  CBF-A  and  FSP1  expression 
after  48  hours  by  qRT-PCR  from  3  indepen¬ 
dent  experiments;  37%  and  49%,  respectively 
( P  <  0.001).  These  data  collectively  indicate 
that  overexpression  of  rCBF-A  in  epithelial 
cells  leads  to  formation  of  the  FTS-1  com¬ 
plex,  expression  of  FSP1,  modulation  of  epi¬ 
thelial  and  fibroblast  genes  including  Snail 
and  Twist,  and  subsequent  EMT. 


tative  real-time  PCR  (qRT-PCR)  analysis  (Figure  3E)  suggests  that 
most  of  the  nuclear  protein  is  newly  synthesized. 

Overexpression  of  CBF-A  engages  the  fibroblast  transcriptome  by  initiating 
EMT.  Treatment  of  cultured  epithelia  with  a  combination  of  EGF 
and  TGF-[3  initiates  EMT  (6),  which  is  associated  with  the  activa¬ 
tion  of  fibroblast  markers  such  as  FSP1,  vimentin,  and  a-SMA  and 
a  downregulation  of  E-cadherin  and  the  tight  junction-associated 
protein  zona  occludens  1  (ZO-1);  the  induction  of  FSP1  precedes 
the  emergence  of  the  EMT  phenotype  and  the  capacity  of  epithe- 
lia  for  movement,  indicating  a  key  early  role  for  FSP1.  A  similar 
induction  of  FSP1  in  MCT/rCBF-A+  cells  (Figure  3D)  suggests  that 
rCBF-A  is  sufficient  to  initiate  EMT.  We  further  checked  for  levels 
of  expression  of  a  number  of  other  epithelial  and  fibroblast  cell 
markers  in  MCT/rCBF-A+  cells,  as  well  as  their  cell  motility  and 
morphology.  Results  from  immunoblotting  indicate  that  MCT/ 
rCBF-A+  cells  have  increased  levels  of  the  fibroblast  markers  FSP1, 


Discussion 

In  this  study,  we  have  discovered  what  we  believe  to  be  a  new  set 
of  transcriptional  proteins  mediating  early  events  in  EMT.  As  an 
experimental  probe,  we  used  a  novel  promoter  element  (FTS-1) 
found  in  the  gene  encoding  FSP1  to  identify  a  novel  complex  of 
transcriptional  proteins,  CBF-A  and  KAP-1,  which  engage  the 
genes  encoding  the  EMT  proteome.  We  conclude  the  following: 
first,  CBF-A  and  KAP-1  can  be  found  in  a  complex  with  the  FTS-1 
element  in  fibroblasts  and  in  epithelia  and  tissues  undergoing 
EMT;  second,  occupancy  of  the  FTS-1  site  by  these  proteins  in  the 
chromatin  of  transitioning  epithelia  correlates  with  the  activation 
of  the  EMT  proteome;  and  third,  the  expression  of  recombinant 
CBF-A  initiates  key,  early  transcriptional  and  phenotypic  events  in 
EMT  (see  schematic  in  Figure  6). 

The  first  component  we  found  bound  to  FTS-1  was  originally 
cloned  as  the  CarG  box-binding  protein  called  CBF-A  (26)  —  not 
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Figure  3 

Overexpression  of  CBF-A  in  the  epithelial  cells  leads  to  formation  of  the  FTS-1  complex  and 
expression  of  fibroblast  markers.  (A)  Transcripts  for  rCBF-A  and  FSP1  increase  in  PCR  cycles 
after  rCBF-A  induction.  (B)  ChIP  assay  shows  formation  of  the  FTS-1  complex  in  epithelial 
cells  expressing  rCBF-A  (+Dox;  doxycycline)  compared  to  3T3  cells.  IgG  was  a  loading  control. 
Immunoprecipitated  DNA  was  used  in  PCR  with  primers  for  FTS-1.  (C)  Nuclear  protein  from 
MCT/rCBF-A+  or  3T3  fibroblasts  form  FTS-1  complexes  in  EMSA  with  similar  affinity,  chal¬ 
lenged  by  10x  or  20x  molar  excess  of  nonlabeled  FTS-1  probe  (Cold  probe).  MCT  cells  were 
the  negative  control.  (D)  Differential  expression  of  epithelial  and  mesenchymal  protein  markers 
in  MCT/rCBF-A+  cells  at  different  hours  after  induction  of  rCBF-A  compared  with  3T3  fibroblasts. 
Coo,  Coomassie  staining.  (E)  Increased  expression  of  CBF-A  and  FSP1  in  UUO  undergoing 
EMT  (28%  and  26%,  respectively)  as  measured  by  qRT-PCR  and  represented  graphically  in 
arbitrary  units.  Included  for  comparison  are  E-cadherin,  a-SMA,  and  KAP-1.  (F)  CBF-A  (yel¬ 
low/orange)  accumulates  in  the  nuclei  of  tubular  and  interstitial  cells  from  the  UUO  kidney  in 
FSP1  .GFP  mice.  FSP1+  fibroblasts  expressing  GFP  under  the  control  of  the  FSP1  promoter 
were  recolored  dark  blue,  and  nuclei  were  counterstained  red.  Normal  contralateral  kidney  tis¬ 
sue  demonstrates  CBF-A  in  the  tubular  cytoplasm  (speckled  green).  In  the  cytoplasm,  CBF-A 
and  FSP1  staining  merge  to  become  light  blue.  Original  magnification,  x630. 


to  be  confused  with  other  proteins  named  similarly:  the  CBF-bind- 
ing  intronic  enhancer  (29),  Cbfal/osteoblast-specific  factor  2  (30), 
the  core  binding  factors  (CBFs)  (31),  PEBP2/CBFA1  (30),  CCAAT- 
binding  factor  a  (CBF-A)  (32),  and  the  hematopoietic  transcrip¬ 
tion  factor  CBFA2  (33). 

Multiple  nucleic  acid  binding  activities  have  been  attributed  to 
CBF-A,  although  its  ability  to  bind  a  DNA  duplex  with  sequence 
homology  to  FTS-1  has  not  been  previously  demonstrated.  As 
a  CArG  box-binding  protein,  it  was  originally  described  as  a 
transcriptional  repressor  of  the  a SMA  promoter  (26).  There  is 


no  sequence  similarity  between  FTS-1 
and  the  canonical  CarG  box  sequence 
(CC[AT]6GG),  and  moreover,  the  SRE  con¬ 
taining  a  canonical  CArG  box  element  does 
not  compete  the  FTS-1  complex.  CBF-A 
has  also  been  described  both  as  condition¬ 
al  repressor  (34)  or  activator  of  the  Ha-ras 
response  element  (35)  and  as  an  activator 
of  the  rat  spi  2  gene  (36)  via  a  GAGA  box. 
CBF-A  also  binds  an  AT-rich  promoter 
element  in  immunoglobulin  kappa  genes 
(37)  and  stabilizes  a  specific  tetraplex 
telomeric  DNA  sequence  (38).  Thus,  the 
overwhelming  evidence  implicates  CBF-A 
as  a  transcriptional  regulator  and/or  struc¬ 
tural  component  of  chromatin.  However, 
remarkably,  CBF-A  contains  no  recogniz¬ 
able  DNA  binding  motif  for  FTS-1. 

CBF-A  also  belongs  to  the  heterogeneous 
nuclear  ribonucleoprotein  (Hnrpab)  sub¬ 
family  A/B  (hnRNP  A/B).  It  contains  2 
evolutionarily  conserved  RNA  recognition 
motifs  found  in  many  RNA-binding  pro¬ 
teins,  functions  as  an  mRNA-editing  pro¬ 
tein  (39),  and  mediates  the  stability  of  pro- 
inflammatory  mRNAs  by  binding  to  the 
AU-rich  element  (40).  As  a  complex  with 
actin,  it  is  involved  in  nuclear-cytoplasmic 
transport  of  mRNA  (41).  Androgen-regu¬ 
lated  posttranslational  modification  of 
CBF-A  is  also  associated  with  cytoskeletal 
changes  in  a  mouse  fetal  cell  line,  which 
may  have  been  unrecognized  EMT  (42). 
Finally,  CBF-A  is  upregulated  in  some 
mammary  tumors  (43)  and  in  solid  tumor 
metastasis  (44).  Depending  on  the  con¬ 
text,  CBF-A  appears  to  be  a  “moonlight¬ 
ing”  protein  (45).  It  activates  or  represses 
transcriptional  and  posttranscriptional 
events,  with  a  particularly  important  role 
in  EMT.  Finally,  by  inference,  the  associa¬ 
tion  of  EMT  with  metastatic  conversion 
(13)  together  with  reports  of  elevated  levels 
of  CBF-A  in  metastasis  (43,  44)  and  its  role 
in  the  transactivation  of  the  Ha-ras  proto¬ 
oncogene  (35)  suggests  a  possible  role  in 
carcinogenesis.  These  properties  are  com¬ 
pletely  consistent  within  our  identification 
of  CBF-A  as  a  regular  of  EMT  (13). 

The  second  protein  identified  in  the  FTS-1 
complex,  KAP-1,  is  a  well-studied  transcriptional  repressor  that 
binds  to  KRAB  domains  on  zinc  finger  proteins  (27).  The  transcrip¬ 
tional  repressor  function  of  KAP-1  is  mediated  through  its  interac¬ 
tion  with  the  heterochromatin-associated  protein  HP1  (46, 47)  and 
with  a  specific  histone  methyltransferase,  SETDB1,  that  contrib¬ 
utes  to  the  silencing  of  actively  transcribed  genes  in  euchromatin 
(48, 49).  It  is  surprising,  but  intriguing,  that  the  KAP-1  corepressor 
is  present  in  our  nucleoprotein  complex  that  functions  to  activate 
transcription.  Moreover,  previous  studies  of  KAP-1  have  shown  it 
to  be  almost  exclusively  bound  to  KRAB-zinc  finger  proteins  in 
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Figure  4 

Inducible  expression  of  rCBF-A  in  MCT  epithelial  cells  leads  to  induction  of  FSP1 ,  changes  in  the  expression  pattern  of  cell  type  markers,  and 
acquisition  of  mesenchymal  phenotype.  (A)  Immunostaining  with  cell  type  markers  of  naive  epithelium  (MCT)  and  of  cells  expressing  rCBF-A. 
Results  of  staining  with  isotype  IgG  are  included  (Control).  Original  magnification,  x630.  (B)  Morphological  changes  in  MCT/rCBF-A+ compared 
with  MCT  cells  at  day  5  of  rCBF-A  induction.  Cells  are  shown  at  both  high  and  low  density.  Original  magnification,  x200.  (C)  Increased  motility  of 
cells  expressing  rCBF-A  (rCBF-A+)  after  doxycycline  induction  as  compared  with  rCBFA"  and  control  MCT  cells.  3T3  fibroblasts  were  included 
in  this  Boyden  chamber  assay  as  positive  control.  The  average  number  of  migrating  cells  is  represented  graphically  and  evaluated  statistically. 


vivo.  However,  our  ChIP  data  conclusively  show  that  both  CBF-A 
and  KAP-1  are  present  at  the  endogenous  CBF-A  site.  There  are  a 
number  of  possibilities  that  may  reconcile  this  new  finding:  first, 
there  is  evidence  that  the  RNA  recognition  motif  (RRM)  in  CBF-A 
may  interact  with  the  PHD  domain  of  KAP-1  (50, 51).  Thus,  CBF-A 
and  KAP-1  may  form  a  physical  complex  that  converts  KAP-1 
into  a  coactivator.  We  have  recent  evidence  that  posttranslational 
modification  by  SUMO  regulates  the  repressor  activity  of  KAP-1, 
thus  providing  a  mechanism  for  toggling  between  activation  and 
repression  (unpublished  observations).  Second,  and  alternatively, 
an  as-yet-unidentified  KRAB-zinc  finger  protein  may  bind  FTS-1 
and  tether  KAP-1  to  the  site.  Subsequently,  the  KAP-1  protein  may 
remain  associated  with  the  chromatin  after  release  of  the  putative 
KRAB-zinc  finger  protein  during  EMT  activation.  Third,  CBF-A 
may  be  bound  to  a  protein  such  as  nucleophosmin,  as  has  been 
recently  described  for  the  immunoglobulin  kappa  promoter  (52). 
These  findings  open  the  possibility  that  the  promoter-binding 
effect  of  CBF-A  can  be  regulated  by  its  interaction  with  various 
protein  partners.  The  interaction  of  CBF-A  with  KAP1  may  confer 
such  specificity  for  binding  the  FTS-1  promoter  element  in  epithe- 
lia  transitioning  to  fibroblasts.  How  KAP- 1  modulates  this  interac¬ 
tion  is  not  yet  known. 

A  key  finding  of  this  study  is  that  rCBF-A- triggers  expression  of 
the  EMT  proteome  in  epithelia  that  parallels  the  changes  in  pheno¬ 


type  and  potential  for  motility,  all  characteristics  compatible  with  a 
fibroblast  phenotype.  Several  other  transcription  proteins  are  also 
expressed  during  EMT.  Twist  (11),  HMGA2  (21),  and  Snail  (53) 
repress  E-cadherin  and  induce  EMT  in  epithelia,  although  loss  of 
E-cadherin  alone  is  not  sufficient  to  initiate  EMT  (54).  A  recent 
report  also  suggests  that  gelsolin  functions  as  a  switch  that  con¬ 
trols  conversion  of  E-cadherin  to  N-cadherin  via  Snail,  and  gelso- 
lin’s  inhibition  triggers  EMT  (55).  Ets-1  (56)  and  Rho-related  pro¬ 
teins  (57)  are  also  involved  in  this  process.  The  fact  that  different 
transcriptional  regulators  can  independently  induce  EMT  suggests 
multifunctionality  to  molecular  activation  that  may  depend  on 
the  context  of  cytokine  signaling  events  or  changes  in  cytoskeletal 
structure  that  force  cells  into  transition  (1).  Since  metastatic  tumor 
cells  employ  the  same  molecular  program  for  EMT  as  that  used  by 
epithelia  converting  to  fibroblasts  (13),  the  FTS-1  complex  is  likely 
used  by  different  epithelia  to  ensure  phenotypic  transitions. 

Several  potential  binding  sites  for  transcription  factors  have 
been  identified  in  genes  implicated  in  the  EMT  transcriptome, 
and  our  in  silico  search  of  the  promoter  regions  of  these  genes 
upstream  of  transcriptional  start  site  are  summarized  in  Table  2. 
The  E-box  motifs  are  present  only  in  2  genes  repressed  in  EMT, 
E-cadherin  and  gelsolin,  but  not  in  ZO-1  or  (5-catenin,  where 
expression  is  equally  attenuated.  The  CArG  box  motif  is  pres¬ 
ent  in  the  repressed  gene  encoding  E-cadherin  but  is  also  present 
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P<0  001 


(control) 

Figure  5 

Inhibition  of  CBF-A  transcripts  by  siRNA  downregulates  expression 
of  FSP1  transcripts.  Three  synthetic  ribonucleotides  were  transfected 
into  3T3  fibroblasts,  and  the  effect  was  monitored  by  qRT-PCR  at 
different  time  points.  The  Ct  values  for  transcripts  encoding  CBF-A 
and  FSP1  at  the  48-hour  time  point  obtained  in  triplicate  were  normal¬ 
ized  to  those  for  GAPDH.  As  siRNA3  did  not  attenuate  the  expression 
of  either  gene,  the  data  were  expressed  as  ratios  to  this  reference 
control;  siRNAI  best  inhibited  transcripts  encoding  CBF-A  and  FSP1 
compared  with  siRNA3;  P  <  0.001.  Presented  are  combined  results 
from  3  separate  experiments. 


in  the  activated  gene  encoding  a-SMA.  The  lymphoid  enhanc¬ 
ing  binding  factor  (LEF-1)  binding  site  is  also  shared  by  up-  and 
downregulated  genes,  reflecting  perhaps  the  context-dependent 
effect  of  LEF-1.  The  most  abundant  regulation  site  is  FTS-1, 
which  again  is  evenly  distributed  among  differentially  expressed 
genes  in  EMT.  It  is  likely  that  a  specific  combination  of  these 
sites  is  necessary  to  achieve  differential  expression  in  the  EMT 
transcriptome.  Accordingly,  the  promoter  regions  of  Twist, 
Snail,  HMGA2,  LEF-1,  and  Etsl  each  contain  FTS-1  sites,  but 
not  CArG-  or  E-box  motifs  (Table  2).  Our  findings  impli¬ 
cate  FTS-1  as  a  critical  site  that  in  combination  with  other 
as-acting  elements  regulates  the  EMT  transcriptome  across  a 
broad  range  of  promoters. 

In  summary,  this  study  reveals  a  mechanism  for  transcrip¬ 
tional  activation  of  EMT  using  the  interaction  of  FTS-1  sites 
with  CBF-A  and  KAP-1.  The  finding  that  the  CBF-A/KAP-1/ 
FTS-1  complex  is  an  activator  of  the  genes  encoding  the  EMT 
proteome  suggests  that  it  is  an  early  proximal  regulator,  if  not 
a  candidate  master  gene,  in  the  molecular  program  leading  to 
fibroblast  formation.  This  discovery  has  important  implica¬ 
tions  for  EMT-dependent  biological  processes,  including  organ 
fibrosis  and  metastatic  cancer.  CBF-A  may  be  a  good  target  can¬ 
didate  for  suppressing  EMT  under  pathologic  circumstances. 


Figure  6 

This  schematic,  based  on  assumptions  derived  from  the  current 
data,  shows  CBF-A  as  a  proximal  transcription  factor  induced 
by  outside-inside  signaling.  Migration  to  the  nucleus  causes  the 
formation  of  CBF-A/KAP-1 /FTS-1  binding  complexes,  which  can 
activate  other  known  transcriptional  regulators  of  EMT,  including 
Snail,  Twist,  FIMGA2,  LEF-1,  and  Ets-1.  These  are  not  the  only 
EMT-related  genes  that  are  activated  by  CBF-A/KAP-1/FTS-1 
binding  complexes  (see  Table  2),  but  they  could  account  for  the 
activation  of  most  of  the  known  and  well-studied  transcriptional 
regulators  of  EMT.  ILK,  integrin-linked  kinase. 


Methods 

Cell  cultures.  NIH  3T3  cells  were  used  as  a  source  of  mouse  fibroblasts.  Cells 
of  the  kidney  proximal  tubular  epithelial  cell  line  MCT  were  derived  from 
SJL/J  mice  (58).  mlMCD  cells  were  a  gift  from  Raymond  Harris  (Vanderbilt 
University,  Nashville,  Tennessee,  USA).  Cell  cultures  were  maintained  as 
described  previously  (6).  For  induction  of  EMT  in  culture,  epithelial  cells 
were  treated  with  3  ng/ml  of  TGF-[3  and  10  ng/ml  of  EGF  in  standard  cul¬ 
ture  media  (6).  The  progress  of  EMT  was  monitored  by  phenotypic  obser¬ 
vation,  cell  motility  assays,  PCR,  immunoblotting,  or  immunochemistry  at 
different  time  points  for  up  to  5  days  after  induction. 

UUO.  Three-month-old  male  BALB/c  mice  (The  Jackson  Laboratory) 
or  FSP1.GFP  mice  on  a  BALB/c  background  were  anesthetized  accord¬ 
ing  to  our  surgical  protocol,  and  the  right  ureter  was  ligated  at  2  points 
(4).  Both  fibrotic  and  normal  contralateral  kidneys  were  harvested  12 
days  after  surgery  and  either  snap-frozen  in  liquid  nitrogen  or  fixed  in  4% 
of  paraformaldehyde.  RNA  was  extracted  from  crushed  frozen  tissue  by 
homogenization  in  TRIzol  (Invitrogen),  and  after  DNAse  I  treatment  and 
inhibition  (Ambion),  5-pg  aliquots  of  RNA  were  used  in  a  reverse  tran¬ 
scription  reaction  with  Superscript  reverse  transcriptase  (Invitrogen). 
The  resulting  cDNA  was  used  as  template  for  qRT-PCR  analysis.  Prim¬ 
ers/probes  for  CBF-A,  FSP1,  E-cadherin,  a-SMA,  KAP-1,  and  GAPDH  were 
obtained  from  TaqMan  Gene  Expression  Assays  (Applied  Biosystems). 
Gene  quantification  was  performed  in  triplicate  on  the  Applied  Biosystems 
7900HT  Fast  Real-Time  PCR  System.  Data  were  expressed  in  threshold 
cycle  (Ct)  values  normalized  against  corresponding  GAPDH  expression. 
The  Institutional  Animal  Care  and  Use  Committee  (LACUC)  at  Vanderbilt 
University  approved  all  animal  studies. 

Immunohistochemistry,  immunofluorescence,  and  immunoblotting.  Polyclonal 
anti-KAP-1  antibody  was  prepared  by  Frank  Rauscher  (Wistar  Institute, 
Philadelphia,  Pennsylvania,  USA);  the  antibody  against  CBF-A  was  kindly 
provided  by  Jonathan  Dean,  Imperial  College  of  Science,  London,  United 
Kingdom;  anti-pan-cadherin  and  anti-cc-SMA  antibodies  were  from 
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Table  2 

Selected  promoter  sites  in  the  EMT  transcriptome 


Genes 

FTS-1 

CArG 

E-box 

LEF-1 

a-Skeletal  actin 

3 

- 

- 

- 

a-SMA 

1 

1 

- 

1 

Annexin  8 

1 

- 

- 

- 

p-Catenin 

2 

- 

- 

- 

al(l)  collagen 

- 

- 

- 

1 

a2(l)  collagen 

3 

2 

- 

- 

Desmoplakin 

2 

- 

- 

E-Cadherin 

1 

1 

1 

1 

Ets-1 

3 

- 

- 

- 

LEF-1 

1 

- 

- 

- 

FSP1 

2 

- 

- 

- 

Smad2,  Smad3 

- 

- 

- 

- 

Gelsolin 

1 

- 

1 

- 

HMGA2 

1 

- 

- 

- 

Laminin  B3 

2 

- 

- 

1 

Lamin  A 

3 

- 

- 

- 

MMP-9 

2 

- 

- 

- 

Moesin 

5 

- 

- 

- 

Rho 

3 

- 

- 

1 

Snail 

3 

- 

- 

1 

Slug 

- 

- 

- 

- 

Twist 

4 

- 

- 

- 

Vimentin 

4 

- 

- 

- 

ZO-1 

3 

- 

- 

Sigma- Aldrich;  antibodies  against  Collagen  I,  Vimentin,  Twist,  Snail- 1, 
N-cadherin,  ZO-1,  |3-catenin,  and  E-cadherin  were  from  Santa  Cruz 
Biotechnology  Inc.  Immunohistochemistry,  immunofluorescence,  and 
immunoblotting  were  performed  as  described  previously  (13,  59).  Prop- 
idium  iodide  (0.1  pg/ml)  (Molecular  Probes;  Invitrogen)  and  1.0  x  10-8  M 
TRITC-conjugated  phalloidin  (Sigma- Aldrich)  were  used  to  stain  nuclei 
and  filamentous  actins.  Isotype  variants  were  included  as  a  negative  con¬ 
trol.  Confocal  fluorescence  intensity  was  measured  using  the  NIH  Image 
program  (http://rsb.info.nih.gov/nih-image).  Image  and  data  analysis 
were  performed  using  the  Vanderbilt  University  Medical  Center  Cell 
Imaging  Core  Resource. 

ChIP.  ChIP  assays  were  performed  using  reagents  from  Upstate  USA 
Inc.  according  to  the  manufacturer’s  protocol  with  the  following  modi¬ 
fications.  Cross-linking  by  formaldehyde  (1%  vol/vol)  was  done  directly  in 
cultured  fibroblasts  or  MCT/rCBF-A  cells  for  10  minutes  at  37°C  and  ter¬ 
minated  by  glycine  at  a  0.125-M  final  concentration.  The  cells  were  washed 
twice  with  PBS  containing  proteinase  inhibitors  (1  mM  PMSF,  1  mg/ml 
aprotinin,  and  1  mg/ml  pepstatin  A).  Nuclei  were  obtained  using  a  nuclear 
purification  kit  (NUC-101;  Sigma- Aldrich)  supplemented  with  the  same 
proteinase  inhibitors  according  to  the  manufacturer’s  instructions.  For 
each  cell  variant,  3  aliquots  of  106  nuclei  were  resuspended  in  SDS  lysis 
buffer  (1%  SDS,  10  mM  EDTA,  50  mM  Tris-HCl,  pH  7.9)  plus  protease 
inhibitor  cocktail  for  mammalian  tissues  (Sigma-Aldrich).  The  nuclei 
were  sonicated  to  yield  DNA  fragments  ranging  from  0.1  to  1  kb.  Follow¬ 
ing  centrifugation  at  10,000  x  g  for  10  minutes  at  4°C,  one  portion  of  the 
lysates  was  analyzed  for  DNA  fragment  size  after  reversing  the  cross-links 
and  purification.  The  remaining  supernatants  were  diluted  10-fold  with 
ChIP  dilution  buffer  and  incubated  by  rotation  with  80  pi  of  50%  salmon 
sperm  DNA/protein  A  agarose  beads  for  90  minutes  at  4°C.  The  beads 
were  removed  by  brief  centrifugation  at  4°C,  and  10  pi  of  either  rabbit 
preimmune  serum  or  rabbit  CBF-A  antibody  was  added  to  the  lysates  and 


incubated  by  rotation  overnight  at  4°C.  The  samples  were  subsequently 
incubated  with  60  pi  of  protein  A  agarose  beads  for  1  hour  at  4°C,  and 
the  beads  were  recovered  by  brief  centrifugation  and  washed  according  to 
the  manufacturer’s  protocol:  once  with  low-salt  0.15  M  NaCl  buffer,  once 
with  high-salt  0.5  M  NaCl  buffer,  and  once  with  0.25  M  LiCl  wash  buf¬ 
fer.  The  pellets  were  washed  once  with  10  mM  Tris-HCl  and  1  mM  EDTA, 
pH  8.0  (Tris-EDTA)  and  resuspended.  The  samples  were  split  in  2,  and  the 
beads  were  collected  by  centrifugation;  one  portion  was  resuspended  in 
SDS  loading  buffer,  boiled  for  10  minutes,  and  loaded  on  a  ready-made 
4%-20%  SDS  gradient  gel  (Bio-Rad)  for  Western  blot  analysis.  The  second 
portion  for  PCR  analysis  was  resuspended  in  Tris-EDTA  plus  0.2  M  NaCl 
and  incubated  overnight  at  65  °  C  to  reverse  protein-DNA  cross-links.  This 
portion  was  used  directly  as  a  template  for  PCR  analysis  with  primers  pro¬ 
ducing  an  approximately  280-bp  fragment  encompassing  the  FTS-1  site  in 
both  directions;  custom  FTS-1  primers  (Invitrogen)  were:  5'-GGTGGTAG- 
ATATTCTGCTCC-3'  and  3'-TCTGCAACAACTCCTTGAGC-5'. 

EMSA.  Nuclei  were  purified  from  cultured  cells  as  described  above  and 
either  frozen  or  extracted.  Nuclear  extracts  were  prepared  in  400  mM 
KCl  buffered  with  20  mM  HEPES  pH  7.9  supplemented  with  a  cock¬ 
tail  of  proteinase  inhibitors  (Sigma-Aldrich).  The  protein  extracts  were 
brought  down  to  100  mM  KCl,  and  DNA  binding  activity  was  examined 
using  the  following  oligonucleotide  probes:  FTS-1  (Invitrogen),  regular: 
5'CACTCACTACTTGATTGTGCCTGCT-3',  mutated:  5'-CACTCAC- 
TACCCAGCTGTGCCTGCT-3';  YB-1  consensus  site  (Santa  Cruz  Biotech¬ 
nology  Inc.),  regular:  5'-AGACCGTACGTGATTGGTTAATCTCTT-3', 
mutated:  5'-AGACCGTACGTGTAACCATAATCTCTT-3';  SRE  (Santa 
Cruz  Biotechnology  Inc.),  regular:  5,-GGATGTCCATATTAGGACATCT-3/, 
mutated:  5  '-GGAT GTCC ATATTATTAC AT CT-3 ' .  EMSA  was  performed 
as  described  previously  (60).  Briefly,  10-15  pg  protein  was  incubated  with 
4  x  105  cpm  of  [32P]ATP  end-labeled  oligonucleotides  in  20  mM  HEPES 
pH  7.9  with  100  mM  KCl,  20%  glycerol,  and  0.2  mM  EDTA.  For  competi¬ 
tion  analyses,  protein  was  preincubated  with  unlabeled  oligonucleotides 
or  antibodies.  FTS-1  complexes  were  resolved  by  5%  PAGE  using  0.5x  Tris- 
borate  buffer  pH  7.8,  vacuum  dried,  and  subjected  to  Phosphorlmager 
analysis  (GE  Healthcare).  Two-dimensional  EMSA/SDS-PAGE  gels  were 
run  with  a  nonradiolabeled  probe;  the  position  of  the  complex  in  the  first 
dimension  was  deduced  from  radiolabeled  duplicates,  and  the  gel  piece 
was  cut  and  inserted  on  top  of  the  SDS  gel.  As  a  negative  control,  nuclear 
extract  without  DNA  was  run  in  parallel  in  the  first  dimension  (EMSA), 
and  the  area  corresponding  to  the  complex  was  cut  and  run  together  with 
the  real  complex  in  the  subsequent  second  dimension  (SDS-PAGE).  For 
mass  spectrometry  analyses,  complexes  of  unlabeled  FTS-1  were  identified 
by  reversible  silver  staining  (SilverQuest;  Invitrogen)  and  matched  to  radio- 
labeled  samples  run  in  parallel.  In  each  experiment,  stained  complexes  were 
cut  from  the  gel  and  subjected  to  trypsinization  before  analysis. 

DNA  affinity  chromatography.  Regular  and  mutated  100-bp  biotinyl¬ 
ated  tetramers  of  the  25-bp  FTS-1  oligonucleotide  were  custom  made 
by  Invitrogen  and  attached  to  Dynabeads  M-280  Streptavidin  (DYNAL; 
Invitrogen).  FTS- 1-specific  complexes  were  formed  with  fibroblast  nucle¬ 
ar  extracts  at  the  conditions  described  for  EMSA.  The  nuclear  extracts 
were  incubated  first  with  the  mutated  tetramer,  and  the  flow-through 
was  used  for  a  secondary  incubation  with  the  normal  FTS-1  tetramer. 
The  beads  were  washed  3  times  with  incubation  buffer,  and  fractions 
were  eluted  with  0.5  M  KCl  followed  by  the  same  buffer  plus  5  M  urea. 
The  eluates  were  combined,  dialyzed  against  10  mM  Tris-HCl  pH  7.5,  and 
analyzed  by  EMSA  for  their  ability  to  form  the  FTS-1  complex.  Positive 
fractions  were  trypsinized  and  subjected  to  tandem  mass  spectrometry 
for  peptide  identification. 

Mass  spectrometry  analysis.  Peptide  mixtures  were  separated  and  analyzed 
using  microcapillary  reverse-phase  HPLC  (RP-HPLC)  coupled  to  tandem 
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mass  spectrometry  (25).  Peptide  mixtures  were  loaded  onto  a  75-pm  inter¬ 
nal  diameter  RP-HPLC  column  (Poros  R2;  Perceptive  Biosystems)  equili¬ 
brated  in  0.5%  acetic  acid  and  eluted  using  a  linear  gradient  of  0%-40% 
acetonitrile  over  60  minutes  followed  by  40%-60%  over  10  minutes  at  a 
flow  rate  of  0.3  pl/min.  Eluting  peptides  were  analyzed  by  electrospray  ion¬ 
ization  mass  spectrometry  using  an  ion  trap  mass  spectrometer  (Finnigan 
LCQ  Deca;  Thermo  Scientific).  All  tandem  spectra  were  searched  against 
the  NCBI  nonredundant  protein  database  (http://www.ncbi.nlm.nih.gov/ 
RefSeq/)  using  the  SEQUEST  algorithm  (25,  61). 

Generation  of  stable  MCT/rCBF-A  cell  lines.  A  mammalian  CBF-A/CBF-A 
expression  vector,  pSR-CBFA-N,  was  amplified  using  the  Advantage  PCR 
system  (Clontech)  with  the  following  primers:  the  5'  primer  included  an 
overhang  with  a  unique  Bglll  restriction  site  (ATTAGATCTGAGTAGCC- 
GCTGCGGCCTGGCCGACCATGT)  and  the  3'  primer  (ATTCTC- 
GAGTCACTCGGCCGCTCCTGCTGCCTCTCAGTA)  added  a  unique 
Xhol  site.  Following  restriction  digestion,  the  sequence  was  subcloned 
into  a  pcDNA5/TO  vector  from  the  T-REx  tetracycline  inducible  system 
(Invitrogen),  producing  the  pcDNA5/TO/rCBF-A  vector.  MCT  cells  were 
transfected  simultaneously  with  both  the  pcDNA5/TO/rCBF-A  vector 
and  pcDNA6/TR  regulatory  vector  of  the  T-REx  system  using  SuperFect 
reagent  (QIAGEN).  Seven  micrograms  of  TR  plasmid  and  3  pg  of  rCBF-A 
plasmid  were  used  to  insure  higher  copy  number  of  the  regulatory  fac¬ 
tor.  Following  transfection,  cells  were  switched  to  media  with  tetracycline- 
free  serum  (Clontech).  Twenty-four  hours  after  transfection,  recombinant 
cells  were  selected  using  5  pg/ml  blasticidin  (Invitrogen).  Once  colonies 
formed,  cells  were  tested  for  the  presence  of  both  vectors  using  the  prim¬ 
ers  5'-AGAGCTGCTTAATGAGGTCG-3'  and  5'-GCATTATATGCACT- 
CAGCGC-3'  for  the  TR  vector;  and  5 ' -T GT C AGT G G AAG C AAGT GTG-3 ' 
and  5 '  -  ACTAG  AAG  G  C  AC  AGT  C  G  AG  G-3 '  (vector  BGH  3'-UTR  specific) 
for  rCBF-A.  Cells  were  treated  with  1  pg/ml  doxycycline  to  induce  rCBF-A 
and  tested  for  changes  in  gene  expression. 

PCR  analysis.  All  primers  were  custom  made  by  Invitrogen:  FSP1:  sense, 
5'-ATGGCAAGACCCTTGGAGGA-3',  anti-sense,  5 '  -C  ATT  G  C  AC  AT  CAT  - 
GGCAATG-3';  GAPDH:  sense,  5'-GCAGTGGCAAAGTGGAGATT-3',  anti- 
sense,  5'-GCAGAAGGGGCGGAGATGAT-3';  rCBF-A.  sense,  5'-TGTCAGT- 
GGAAGCAAGTGTG-3';  anti-sense,  5'-ACTAGAAGGCACAGTCGAGG-3' 
(derived  from  BGH  3'-UTR  of  the  vector).  Reverse  transcription  PCR  was 
performed  using  5  pg  of  total  DNAse  I-treated  RNA  with  Superscript 
reverse  transcriptase  (Invitrogen).  Aliquots  of  cDNA,  2  pg  or  0.2  pg,  were 
subjected  to  PCR  for  40  cycles,  with  10-pl  aliquots  removed  at  20,  25,  30, 
35,  and  40  cycles.  Fragments  were  separated  in  agarose  gels  and  stained 
with  ethidium  bromide,  and  the  images  were  analyzed  using  Scion  Image 
(Scion  Corp.).  For  qRT-PCR  analysis,  total  RNA  was  collected  at  different 
time  points,  treated  with  DNAse  I,  and  converted  by  reverse  transcription. 
Primers/probes  for  CBF-A,  FSP1,  and  GAPDH  were  from  TaqMan  Gene 
Expression  Assays  (Applied  Biosystems).  Gene  quantification  was  per¬ 
formed  on  the  Applied  Biosystems  7900HT  Fast  Real-Time  PCR  System. 
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siRNA  inhibition.  The  Silencer  Pre-designed  siRNA  oligonucleotides 
(Ambion)  targeted  exons  2  and  3  or  exon  4  of  CBF-A/CBF-A.  These  exons 
are  not  subject  to  alternative  splicing.  Each  siRNA  was  separately  trans¬ 
fected  into  3T3  fibroblasts  using  Lipofectamine  2000  (Invitrogen)  fol¬ 
lowing  the  manufacturer’s  instruction  for  transfection.  Negative  control 
siRNA  (Ambion)  was  transfected  in  parallel  for  comparison.  The  levels  of 
CBF-A  and  FSP1  expression  were  monitored  at  different  time  points  after 
transfection  by  qRT-PCR  using  extracted  total  RNA  and  primers/probes  as 
described  above.  The  Ct  values  obtained  were  normalized  against  the  cor¬ 
responding  Ct  values  of  GAPDH.  The  results  from  3  separate  experiments 
were  examined  for  statistical  differences.  siRNA3  was  selected  as  a  negative 
control  due  to  its  lack  of  effect  on  CBF-A  expression. 

Motility  assay.  Migration  through  a  membrane  in  response  to  a  serum  gra¬ 
dient  (Boyden  chamber  assay)  was  performed  in  24-well  culture  plates  with 
8-pm  pore  size  inserts  (BD).  Cells  were  grown  for  5  days  with  or  without 
doxycycline.  Cells  (2  x  101 2 3 4 5)  in  0.1%  BSA-containing  medium  were  loaded 
onto  the  top  of  each  membrane.  The  bottom  wells  contained  15%  FBS. 
When  needed,  doxycycline  was  added  into  both  wells.  The  plates  were  incu¬ 
bated  for  12  hours  at  37°  C;  membranes  were  removed  and  stained  with 
Diff-Quik  stain  (IMEB).  Cells  were  monitored  in  each  field  by  a  digital 
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Statistics.  All  values  were  expressed  as  mean  ±  SEM.  Mann-Whitney  U  tests 
were  used  to  compare  the  mean  of  non-parametric  groups.  Values  of  P  less 
than  0.05  were  considered  statistically  significant. 
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The  KRAB  domain  is  a  75  amino  acid  transcriptional  repression  module  that 
is  encoded  by  more  than  400  zinc  finger  protein  genes,  making  it  the  most 
abundant  repression  domain  in  the  human  proteome.  KRAB-mediated  gene 
silencing  requires  a  direct  high  affinity  interaction  with  the  RBCC  domain  of 
KAP-1  co-repressor.  The  structures  of  the  free  KRAB  domain  or  the  KRAB- 
RBCC  complex  are  unknown.  To  address  this,  we  have  performed  a 
systematic  biophysical  analysis  of  all  KRAB  isoforms  using  purified 
recombinant  proteins.  All  KRAB  domains  are  predominantly  monomeric 
either  alone  or  in  a  complex  with  KAP-l-RBCC  protein,  while  a  KRAB- 
SCAN  isoform  exists  as  a  stable  dimer.  The  KRAB:KAP-1-RBCC  interaction 
requires  only  the  A  box  in  the  context  of  the  KRAB(Ab)  or  KRAB(AC)  but 
both  A  and  B  boxes  in  the  context  of  KRAB(AB).  All  isoforms  bind  the  KAP- 
l-RBCC  in  a  stable,  zinc  dependent  fashion  with  a  stoichiometry  of 
KRAB1:3  RBCC  with  a  zinc  content  of  four  atoms  per  RBCC  monomer. 
Limited  proteolysis,  mass  spectrometry  and  N-terminal  sequence  analyses 
suggest  that  a  core  complex  comprises  the  entire  RBCC  domain  of  KAP-1 
and  the  AB  box  of  the  KRAB  domain  rendering  it  resistant  to  proteolysis. 
NMR  spectroscopy  showed  that  unbound  KRAB  domain  does  not  exist  as  a 
well-folded  globular  protein  in  solution  but  may  fold  into  an  ordered 
structure  upon  binding  to  the  KAP-l-RBCC  protein.  This  is  the  first 
example  of  a  structurally  disordered  repressor  domain  that  is  the  most 
widely  conserved  silencing  domain  in  tetrapods. 

©  2007  Elsevier  Ltd.  All  rights  reserved. 

Keywords:  transcriptional  repression;  KRAB  domain;  KAP-1;  protein- 
protein  interaction;  structurally  disordered  protein 


Introduction 

Biological  processes  such  as  transcriptional  reg¬ 
ulation,  translation,  and  cellular  signal  transduction 


Abbreviations  used:  KRAB  domain,  Kruppel  associated 
box;  ZFP,  zinc  finger  protein;  KAP-1,  KRAB  associated 
protein  1;  RBCC  domain,  RING  finger,  B  boxes,  and 
coiled-coil  region;  Ni-NTA,  Ni2+-intrilo-tri-acetic  acid; 
DLS,  dynamic  light  scattering;  ICP,  inductivity  coupled 
plasma;  HSQC,  heteronuclear  single  quantum  coherence. 
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depend  on  the  specific  and  temporal  targeting  of 
macromolecular  complexes.  The  proteins  involved 
in  these  processes  are  composed  of  multiple  in¬ 
dependently  folded  globular  domains  that  coop¬ 
erate  in  macromolecular  recognition.  These  domains 
are  usually  recognizable  by  highly  conserved  sig¬ 
nature  sequence  motifs  and  their  presence  and 
organization  within  a  protein  sometimes  provides 
clues  into  the  protein's  function.  Modular  motifs 
within  multi-domain  proteins  often  specify  specific 
interacting  partners  and  the  identification  of  these 
interactions  has  been  beneficial  in  defining  biochem¬ 
ical  functions  of  particular  proteins  and  their  cellular 
pathways.  Sequence  homology  prediction  alone  to 
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identify  domain  or/and  motif  type  may  lead  to 
ambiguous  assignments  of  domain  type,  making 
prediction  of  function  a  difficult  task.  Therefore, 
biochemical  and  biophysical  characterization  and 
determination  of  dynamics  and  the  three-dimen¬ 
sional  structure  information  are  often  essential  for 
understanding  biological  function,  especially  for 
the  domains  that  lack  intrinsic  globular  structure 
under  physiological  conditions  and  fold  into  an 
ordered  structure  upon  binding  to  their  targets. 
With  these  principles  in  mind,  we  have  been  char¬ 
acterizing  the  Kruppel-associated  box  (KRAB)  do¬ 
main  of  the  KRAB-zinc  finger  protein  (KRAB-ZFP) 
family. 

Modular  transferable  repression  domains  have 
emerged  as  a  set  of  highly  conserved  structural 
motifs  in  families  of  transcription  factors.  These 
conserved  repression  domains  include  the  BTB/ 
POZ,  KRAB,  SCAN  and  SNAG  modules.  The  BTB/ 
POZ  domain  mediates  homodimerization  and  het¬ 
erodimerization  between  BTB/POZ  proteins.1,2 
The  structure  of  the  BTB/POZ  domain  was  deter¬ 
mined  and  it  exists  as  a  homodimer." 4  The  SCAN 
domain  functions  as  a  protein-protein  interaction 
domain,  mediating  self-association  or  selective 
association  with  other  proteins. 5,6  The  isolated 
SCAN  domain  forms  a  stable  dimer  in  solution/ 
and  three-dimensional  structure  of  this  dimer 
from  the  human  ZNF174  protein  was  recently  de¬ 
termined.8  We  focused  on  the  KRAB  domain 
as  a  model  system  for  the  analysis  of  repression 
modules.9-12  Analysis  of  the  human  genome  se¬ 
quence  revealed  423  independent  KRAB-ZFP 
genes,  yielding  alternative  transcripts  that  alto¬ 
gether  predict  at  least  742  structurally  distinct  pro¬ 
teins.13  The  KRAB  domain  is  exclusively  found 
in  the  NFI2  termini  of  Krupple-related  ZFPs  and 
it  is  classically  divided  into  an  A  and  a  B  box. 
KARB-ZFPs  can  be  classified  into  several  subfami¬ 
lies:  the  first  containing  both  a  classical  A  and  a 
classical  B  motif  (KRAB(AB)),  the  second  contain¬ 
ing  a  classical  A  motif  and  a  highly  divergent  b 
motif  (KRAB(Ab)),  and  the  third  containing  a 
classical  A  motif  and  another  highly  divergent  C 
motif  (KRAB(AC))  (Figure  l).14  Some  KRAB-ZFPs 
contain  only  an  A  box  (KRAB(A)).  In  addition  to 
the  KRAB  domain,  some  KRAB-ZFPs  also  carry  a 
SCAN  or  leucine-rich  (LeR)  domain  (Figure  1). 
Thus,  the  KRAB  domain  may  represent  structurally 
and  functionally  divergent  properties  relevant  to  its 
repression. 

The  KRAB  domain  was  shown  to  be  a  potent, 
DNA  binding-dependent  transcriptional  repression 
module.10,15  The  spatial  and  temporal  expression  of 
KRAB-ZFPs  suggests  that  their  biological  functions 
could  be  involved  in  regulation  of  embryonic 
development,  cell  differentiation,  and  cell  transfor¬ 
mation.  16-19  Some  KRAB-ZFPs  are  mainly  restricted 
to  lymphoid  cells  and  may  play  a  specific  role  in 
lymphoid  differentiation,  whereas  others  are  ex¬ 
pressed  and  specifically  down-regulated  during 
myeloid  differentiation.  Similarly,  a  recent  study 
assigned  a  biological  function  in  controlling  sexual 


dimorphic  liver  gene  expression  to  a  set  of  KRAB- 
ZFPs  (Rsll  and  Rs/2).20  A  number  of  KRAB-ZFPs 
are  candidate  genes  for  human  diseases  based  on 
their  chromosomal  locations.21,22  Mutations  of  two 
KRAB-ZFP  genes  (ZNF41  and  ZNF81)  clustered 
on  the  X  chromosome  are  found  in  patients  with 
mental  retardation,  suggesting  they  are  critical  for 
development. 23,24 

Transcriptional  repression  by  the  KRAB  domain 
correlates  with  its  binding  to  KAP-1  (KRAB 
associated  protein  1),  also  referred  to  as  TIFlfJ  or 
KRIP-1.  KAP-1  belongs  to  a  family  of  transcription 
co-regulators,  which  includs  TIFla,  TIFly,  TIF18, 
Bonus,  and  Ectodermin. 9,25-29  The  TIF1  family 
encodes  the  signature  RING-B  box-coiled-coil 
tripartite  motif  (RBCC  motif)  at  its  NFI2  terminus, 
and  a  PFID-bromodomain  at  its  COOH  terminus. 
The  RBCC  motif  likely  functions  as  a  cooperative 
protein/protein  interaction  motif.12,30,31  We  have 
characterized  the  KRAB:KAP-1  interaction  exten¬ 
sively  and  this  analysis  has  revealed  that  the 
RBCC  functions  as  a  protein  interaction  do¬ 
main.11,12  The  KRAB  domain  binds  directly  with 
high  affinity  to  the  RBCC  domain  of  KAP-1.  This 
binding  requires  all  three  sub-domains  of  the 
RBCC  domain  and  the  mutation  of  any  individual 
sub-domain  abolishes  binding.  The  KAP-l-RBCC 
domain  oligomerizes  as  a  homo-trimer,  and  oligo¬ 
merization  is  obligate  for  KRAB  binding.  The  stoi¬ 
chiometry  of  the  KRAB:KAP-1  complex  is  1:3.  In 
vivo,  KRAB-mediated  gene  silencing  is  absolutely 
dependent  on  KAP-1  binding.9  The  PXVXL  motif 
of  KAP-1  directly  binds  heterochromatin  protein 
HP1. 32,33  The  PFID  and  bromodomain  of  KAP-1 
recruits  the  NURD  histone  deacetylase  complex, 
SETDB1  histone  H3-K9  methyl  transferase,  and 
other  modification  mechanisms  involved  in  gene- 
silencing  (A.  Ivanov,  unpublished  data).34-36 
Although  the  KRAB-KAP-l-mediated  silencing 
mechanism  has  been  well  characterized  compara¬ 
tively  and  the  structural  feature  and  properties  of 
KRAB  domain  has  been  investigated,37  the  three- 
dimensional  structure  of  KRAB  domain  remains  to 
be  determined. 

It  has  been  well  established  that  the  functions  of 
many  proteins  are  directly  related  to  their  three- 
dimensional  structure.  Flowever,  many  proteins 
lack  intrinsic  globular  structure  under  physiological 
conditions."8  Such  proteins  are  often  involved  in 
important  regulatory  functions  within  the  cell  and 
fold  into  an  ordered  structure  upon  binding  to  their 
target.  An  intrinsic  lack  of  structure  can  confer  func¬ 
tional  advantages  to  a  protein.  There  are  numerous 
examples  of  domains,  such  as  the  kinase-inducible 
activation  domain  of  the  CREB  and  the  acidic  ac¬ 
tivation  domain  of  p53  and  VP16,  that  are  unstruc¬ 
tured  in  solution  but  become  structured  upon 
binding  to  their  target  proteins:  CBP,  MDM2,  and 
TAFII31,  respectively.39-42 

We  have  performed  a  comprehensive  and  sys¬ 
tematic  analysis  for  the  KRAB  domains  using  bio¬ 
chemical,  biophysical,  and  structural  approaches. 
Our  data  indicate  that  the  KRAB  domain  exists  as  a 
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Figure  1.  A  diagram  illustrating  the  architecture  of  KRAB-ZFPs  and  recombinant  derivatives  used  in  this  study.  The 
numbers  represent  amino  acid  position.  A,  A  box;  B,  B  box;  b,  b  box;  C,  C  box;  ZNF  region,  zinc  finger  region;  SCAN, 
SCAN  domain.  The  column  to  the  right  indicates  the  expressed  proteins.  Database  accession  numbers:  KOX1 
(NM_015394);  AJ18(NM_023988);  MTZ1(L28167);  MZF13(AF242376);  ZNF202(NM_003455);  ZNF274(NM_016325). 


monomer.  One  molecule  of  KRAB  binds  three 
molecules  of  KAP-l-RBCC.  We  also  show  that  the 
KRAB  domain  is  a  structurally  disordered  in 
solution  and  it  actively  binds  to  its  ligand,  the 
RBCC  domain  of  KAP-1.  These  data  may  have 
broader  implications  for  studying  other  tran¬ 
scriptional  repressors  and/or  activators  with  their 
co-factors. 


Results 

Structure  and  sequence  analyses  of  the 
KRAB-ZFPs  family 

KRAB-ZFP  genes  are  very  abundant  in  the 
human  genome.  We  began  our  studies  by  aligning 
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and  analyzing  the  protein  sequence  of  the  KRAB- 
ZFPs.  The  KRAB  domain  homology  consists  of  ~  75 
amino  acid  residues  and  is  predicted  to  fold  into  two 
amphipathic  helices  that  are  involved  in  protein- 
protein  interaction  (Figure  l).43  The  analyses  indi¬ 
cate  that  KOXl-KRAB(AB)  displays  54%  identity 
(75%  similarity)  with  AJ18-KRAB(Ab),  and  55% 
identity  (80%  similarity)  with  MZF13-KRAB(AC), 
while  AJ18-KRAB(Ab)  displays  51%  identity  (82% 
similarity)  with  MZF13-KRAB(AC)  over  the  whole 


KRAB  domain.  The  high  degree  of  identity  and 
similarity  among  the  subfamilies  is  exhibited  only  in 
the  A  box  and  no  significant  similarity  was  found 
between  the  other  boxes  (Figure  2(a)).  The  presence 
of  B,  b,  or  C  boxes  may  influence  the  specific  ac¬ 
tivities  of  different  KRAB  domains  and  the  bind¬ 
ing  to  the  KAP-1  protein.44  Flowever,  the  structural 
basis  of  the  B,  b  or  C  boxes  in  the  recognition  of 
their  physiologically  relevant  ligand  remains  to  be 
determined. 
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Figure  2.  (a)  Sequence  alignment  of  the  KRAB  domain  and  SCAN  domain  from  KRAB-ZFPs.  Numbers  refer  to 
amino  acid  positions  in  the  corresponding  proteins.  The  KRAB  and  SCAN  domains  consensus  residues  are  highlighted  in 
black.  The  periods  represent  spaces  introduced  to  obtain  maximal  alignment,  (b)  The  CD  spectra  of  the  KRAB(Ab)  of  AJ18 
and  KRAB(AB)-SCAN  of  ZNF274.  CD  spectra  were  carried  out  in  a  buffer  containing  20  mM  Hepes  (pH  7.5),  100  mM 
NaCl,  1  mM  TCEP.  A  CD  spectrum  for  a  AJ18-KRAB(Ab)  protein  containing  65%  random  coil  and  35%  helix  is  plotted 
(black).  A  CD  spectrum  for  a  ZNF274-KRAB(AB)-SCAN  protein  containing  56%  random  coil,  39%  helix,  and  5%  p  sheet 
is  plotted  (red). 
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The  program  Predictproteint  suggests  that  the 
KRAB  domain  possesses  a  helical  content  of  ap¬ 
proximately  35%  with  two  major  helices  and  three 
loops.  These  helices  have  an  amphipathic  nature  as 
defined  by  helical  wheel  analysis.  These  predictions 
indicate  that  the  KRAB  domain  may  provide  an 
interface  for  protein-protein  interactions  and  that  it 
may  bind  to  the  RBCC  domain  of  KAP-1  via  a  helix/ 
helix  interaction. 

Expression  and  purification  of  recombinant 
KRAB  domain  proteins 

The  KRAB  domains  utilized  here  include  seven 
members  from  KRAB(AB),  two  from  KRAB(Ab), 
one  from  KRAB(AC),  one  from  SCAN-KRAB(A), 
and  one  from  KRAB(AB)-SCAN-KRAB(A)  subfa¬ 
milies  (Figure  1;  Table  1).  We  selected  the  KRAB 
domains  from  all  subfamilies  for  the  following 
reasons:  (1)  the  KOXl-KRAB(AB)  domain  was 
originally  utilized  to  isolate  the  KAP-1  co-repressor, 
and  mutations  in  this  domain  that  concomitantly 
abolish  repression  and  KAP-1  binding  are  well 
characterized;9  (2)  the  KOXl-KRAB(AB)  domain  is 
highly  expressed  in  Escherichia  coli,  and  is  well 
behaved  in  protein  reconstitution  assays;11,12  (3)  the 
GAL4-KOXl-KRAB(AB)  and  GAL4-SZF1-KRAB 
(AB)  fusion  is  a  potent,  KAP-1 -dependent,  and 
DNA  binding-dependent  transcriptional  repressor 
in  vivo;10,45  (4)  we  hypothesized  that  the  variant 
amino  acids  in  B,  b,  and  C  boxes  may  influence  the 
folding  of  the  KRAB  domain  so  that  the  solubility 
and  behavior  of  the  proteins  may  be  different. 

To  optimize  the  protein  expression  level  and 
solubility,  three  protein  expression  systems  were 
used  in  E.  coli.  A  baculovirus  expression  system  was 
utilized  as  well  in  the  event  that  post-translation 
modification  might  become  important  (Table  1).  All 
His-tagged  proteins  were  first  purified  using  Nr 1  - 
NTA  chromatography  under  native  conditions, 
followed  by  gel  filtration  chromatography.  Most  of 
the  His-tagged  KRAB(AB)  domain  proteins  were  not 
soluble  under  native  conditions  except  His-tagged 
AJ18-KRAB(Ab)  (Table  1).  The  GST-fusion  proteins 
were  purified  with  GST  resin,  followed  by  a  throm¬ 
bin  digestion  to  release  the  target  proteins.  Further 
purification  was  carried  out  using  ion-exchange  and/ 
or  gel  filtration  chromatography  (data  not  shown). 
GST-AJ18-KRAB(Ab)  and  GST-ZNF274-KRAB(AB) 
were  highly  soluble  and  stable  under  native  condi¬ 
tions.  SDS-PAGE  analyses  revealed  that  the  purified 
soluble  His-tagged  or  GST-fusion  proteins  migrated 
as  predicted  by  their  molecular  mass.  Production 
of  these  highly  soluble  proteins  allowed  us  to  con¬ 
duct  further  functional  and  structural  analyses. 

To  address  whether  the  highly  soluble  KRAB 
domain  proteins  were  structured,  dynamic  light 
scattering  (DLS)  analyses  were  performed  using  a 
DynaPro-801  molecular  sizing  instrument.  These 
analyses  showed  that  most  of  the  KRAB  proteins 
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display  moderate  polydispersity,  with  the  AJ18- 
KRAB(Ab)  protein  exhibiting  the  greatest  polydis¬ 
persity,  indicating  that  they  are  generally  prone  to 
aggregation.  CD  spectra  of  AJ18-KRAB(Ab),  and 
ZNF274-KRAB(AB)-SCAN  proteins  showed  that 
they  contain  mostly  random  coil  (56%  —  65%)  with 
a  small  degree  of  helix  (35%  ~  39%)  but  little  (’>  sheet 
(0  ~  5%)  (Figure  2(b)),  consistent  with  the  secondary 
structure  predicted  by  Predictprotein.  These  data 
suggest  that  the  KRAB  domain  alone  may  not  be 
highly  structured. 

The  KRAB  domain  of  ZFP  exists  in  a  monomeric 
state  in  solution 

We  previously  employed  biochemical  and  bio¬ 
physical  analyses  for  the  KOXl-KRAB(AB):KAP-l- 
RBCC  complex  and  its  components.  The  KOX1- 
KRAB(AB)  was  a  soluble  self-aggregate  under 
physiological  buffer  conditions.  The  non-aggregated 
portion  of  KOXl-KRAB(AB)  could  be  separated 
from  aggregate  by  gel  filtration  in  the  presence  of 
non-denaturing  detergent  (Figure  3(a),  I).  Although 
it  is  predominantly  monomeric  in  the  absence  of 
KAP-1,  this  non-aggregated  KOXl-KRAB(AB)  pro¬ 
tein  is  highly  active  for  binding  to  the  KAP-l-RBCC 
when  these  factors  co-exist  in  solution.  To  explore 
the  oligomeric  state  of  other  KRAB  domains,  gel 
filtration  chromatography  and  analytical  ultracen¬ 
trifugation  were  used.  Gel  filtration  studies  showed 
that  under  physiological  buffer  conditions,  the 
AJ18-KRAB(Ab)  protein  (calculated  Mr  of  monomer 
is  10  kDa)  eluted  in  a  single  peak  =  17  kDa,  consistent 
with  a  monomer  (Figure  3(a),  II).  In  contrast,  the 
ZNF274-KRAB(AB)-SCAN  protein  (calculated  Mr 
of  dimer  is  57.41  kDa)  eluted  in  a  single  peak  = 
130  kDa,  consistent  with  a  tetramer  or  an  elongated 
dimer  (Figure  3(a),  III). 

To  more  rigorously  establish  the  oligomeric  state 
of  the  KRAB  domains,  we  employed  equilibrium 
sedimentation  experiments.  Analyses  were  per¬ 
formed  at  4  °C  at  different  speeds  and  protein 
concentrations.  The  concentration  of  protein  versus 
radius  data  was  fitted  with  various  models  of  self¬ 
association  using  non-linear  regression.46  For  AJ18- 
KRAB(Ab)  protein,  the  data  were  best  described 
by  a  model  containing  predominantly  monomer 
(90%  ~  95%)  and  a  small  amount  of  dimer  with  Kd 
value  of  1.3  mM  at  speed  of  38,900  rpm,  and  with  Kd 
value  of  5.1  mM  at  speed  of  55,000  rpm  (Figure  3(b)), 
indicating  that  the  AJ18-KRAB(Ab)  protein  is 
predominantly  a  monomer.  For  the  ZNF274-KRAB 
(AB)-SCAN  protein,  the  data  were  best  fit  by  a 
model  containing  a  single  species  of  dimer  with  a  Mr 
of  57,701  kDa  at  speed  of  23,200  rpm  (Figure  3(c)). 
This  value  is  close  to  the  calculated  Mr  of  the  dimer 
57,412.  This  result  indicates  that  the  ZNF274-KRAB 
(AB)-SCAN  protein  is  a  dimer  and  suggests  that  the 
130  kDa  size  observed  from  gel  filtration  is  due  to  an 
asymmetric  shape.  The  data  also  suggests  that  the 
SCAN  domain  is  responsible  for  dimerization  and  is 
consistent  with  the  observation  that  the  SCAN 
domain  from  ZNF174  also  forms  a  dimer.8 
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Table  1.  Expression,  solubility,  and  function  for  the  KRAB  domain  of  KRAB- 
protein 

-ZNPs  and  the  RBCC  domain  of  KAP-1 

Expression  system  tag 

Proteins 

Solubility3 

KAP-l-bindingb 

KRAB  domain 

Intein 

KOXl-KRABIntein,  1-90 

+ 

+ 

6Histidine 

KOXl-KRAB(AB)HIS,  1-161 

- 

+  + 

KOXl-KRAB(AB)HIS,  1-90 

- 

++ 

KOXl-KRAB(AB)HIS,  12-76 

- 

++ 

SZF1-KRAB(AB)HIS,  3-110 

- 

+  + 

ZNF133-KRAB(AB)N-HIS,  1-119 

- 

ND 

ZNF133-KRAB(AB)C-HIS,  1-119 

- 

ND 

ZNF140-KRAB(AB)HIS,  1-119 

- 

ND 

ZNF141-KRAB(AB)HIS,  1-107 

- 

ND 

EEK1-KRAB(AB)HIS,  1-122 

- 

ND 

KRK1  -KRAB(  AB)HIS,  16-127 

- 

ND 

AJ 1 8-KRAB(  Ab  )HIS,  4-91d 

+++ 

+  + 

GST 

KOXl-KRAB(AB)GST,  1-90 

+ 

++ 

KOXl-KRAB(DV)GST,  1-90 

+ 

- 

KOXl-KRAB(A)GST,  2-59 

+ 

- 

SZF1-KRAB(AB)GST,  3-110 

+ 

++ 

AJ18-KRAB(Ab)LGST,  4-216 

+ 

ND 

AJ18-KRAB(Ab)GST,  4-91c'd 

(+++) 

(++) 

AJ18-KRAB(A)GST,  4-58 

+++ 

++ 

MTZl-KRAB(Ab)GST,  2-81 

+ 

+  + 

MTZ1-KRAB(A)GST,  2-50 

+ 

+  + 

MZF13-KRAB(AC)GST,  2-70 

+ 

++ 

MZF13-KRAB(A)GST,  2-49 

+ 

++ 

ZNF202-SCAN-KRAB(A)GST,  37-286 

+ 

- 

ZNF274-KRAB(AB)-SCANGST,  4-246c 

(+++) 

(+) 

ZNF274-KRAB(AB)GST,  4-77"* 

(+++) 

(++) 

6Histidine  baculovirus 

ZNE140-KRAB(AB)HIS,  1-119 

- 

ND 

RBCC  domain 

6Histidine 

KAP-1  -RBCCHIS,  22-418 

++ 

+  + 

KAP-1  -RBCCHIS,  50-418 

++ 

++ 

KAP-l-RBCCHIS,  TEV,  50-418 

++ 

++ 

KAP-1-B1B2CCHIS,  136-418 

- 

- 

KAP-1  -B2CCHIS,  199^18 

+ 

- 

6Histidine  baculovirus 

KAP-1  -RBCCHIS,  22-418 

++ 

++ 

pRMl  and  6histidine 

KOXl-KRAB(AB),  l-90c 

+++ 

+  + 

KAP-1 -RBCCHIS,  22-418 

+++ 

+  + 

KOXl-KRAB(AB),  12-76c 

+++ 

++ 

KAP-1 -RBCCHIS,  22-418 

+++ 

+  + 

KOXl-KRAB(AB),  12-90c 

+++ 

+  + 

KAP-1  -RBCCHIS,  50-418 

+++ 

++ 

KOXl-KRAB(AB),  12-65c 

+++ 

+  + 

KAP-1 -RBCCHIS,  50-418 

+++ 

+  + 

KOXl-KRAB(AB),  12-90c 

+++ 

++ 

KAP-l-RBCCHIS,  TEV,  50-418 

+++ 

+  + 

AJ18-KRAB(Ab),  4-91 

+++ 

+  + 

KAP-l-RBCCHIS,  50-418 

+++ 

+  + 

6  Histidine  baculovirus 

ZNE140-KRAB(AB)HIS,  1-119 

- 

ND 

KAP-l-RBCCHIS,  22-418 

++ 

ND 

ND,  not  determined.  Parentheses  indicate  that  the  protein  was  tested  after  cleavage  of  the  GST  tag. 

a  Solubility  indicates  the  initial  solubility  of  the  expressed  proteins.  A-  means  that  the  proteins 

are  not  soluble  under  native 

purification  conditions.  These  proteins  were  purified  under  denaturing  conditions  and 

were  refolded  by  either  step  dialysis  or  on  a 

column  for  functional  studies.  A+  indicates  that  the  proteins  are  soluble  under  native  purification  conditions. 

KAP-1  binding  indicates  the  KRAB  domain  proteins  were  tested  by  GST-association  assays. 

competition  assays,  transient 

transfection  assays  for  repression. 

c  The  proteins  or  protein  complexes 

were  set  for  crystallization  screen  using  screen  kits  including  (1)  Crystal  Screen,  (2)  Crystal  Screen 

2,  (3)  Wizard  I,  (4)  Wizard  II,  and  (5)  Index  at  temperature  25  °C  and  4  °C. 

d  The  proteins  were  studied  by  NMR  spectroscopy. 

The  KRAB(AB),  KRAB(Ab),  KRAB(AC)  domains 
form  a  complex  with  the  RBCC  domain  of  KAP-1 

We  reconstituted  the  KOXl-KRAB(AB):KAP-l- 
RBCC  protein  complex  using  recombinant  proteins 
in  vitro.11,12  The  yeast  two-hybrid  assay  indicates 
that  the  KRAB(AB),  KRAB  (Ab)  and  KRAB  (AC) 


interact  with  KAP-1  in  vivo,  and  that  the  B,  b  and  C 
boxes  influence  this  interaction  differently.44  To 
define  which  of  the  A,  B,  b,  and  C  boxes  of  KRAB 
are  integral  to  its  interaction  with  the  KAP-1 -RBCC, 
we  prepared  a  series  of  B,  b,  or  C  box  deletions  in 
each  of  the  KRAB  domains  in  GST-fusion  format 
(Figure  1;  Table  1).  The  GST-fusion  proteins  were 
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Figure  3.  (a)  Gel  filtration  analyses  of  KOXl-KRAB(AB),  AJ18-KRAB(Ab),  and  ZNF274-KRAB(AB)-SCAN  proteins. 
The  proteins  were  resolved  on  a  Superdex  200  column.  Fractions  were  collected  and  proteins  were  analyzed  by  SDS- 
PAGE.  The  elution  profiles  of  KOXl-KRAB(AB)  protein  (I),  AJ 1 8-KR AB ( Ab )  protein  (II),  and  ZNF274-KRAB(AB)-SCAN 
protein  (III)  on  the  gel  filtration  column,  (b)  Representation  of  analytical  ultracentrifugation  analysis  of  AJ18-KRAB(Ab). 
Sedimentation  equilibrium  analyses  were  performed  with  three  different  loading  concentrations  in  separate  cells  at 
38,900  rpm  and  55,000  rpm  at  4  °C.  The  circles  in  the  main  panel  (IV)  show  the  concentration  versus  radius  data  for  the 
three  cells  at  equilibrium.  The  three  data  sets  were  fitted  with  a  model  describing  monomers.  I— III  shows  the  residuals  for 
the  data  points  to  the  fitted  curves  at  the  three  protein  concentrations  displayed  from  highest  concentration  (top)  to  lowest 
concentration,  (c)  Representation  of  analytical  ultracentrifugation  analysis  of  ZNF274-KRAB(AB)-SCAN.  The  circles  in 
the  main  panel  (IV)  show  the  concentration  versus  radius  data  for  the  three  cells  at  equilibrium.  The  three  data  sets  were 
fitted  with  a  model  describing  dimers.  I— III  shows  the  residuals  for  the  data  points  to  the  fitted  curves  at  the  three  protein 
concentration  displayed  from  highest  concentration  (top)  to  lowest  concentration. 
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expressed  and  purified  in  E.  coli,  and  analyzed  for 
binding  to  the  KAP-l-RBCC  by  GST  association 
assay  (Figure  4(a)).  These  studies  reveal  that  dele¬ 
tion  of  the  B  box  in  KRAB(AB)  exhibits  greatly 
diminished  binding  to  KAP-l-RBCC.  A  similar 
result  was  obtained  from  the  KRAB-O  protein  in 
which  the  deletion  of  B  box  significantly  reduced 


KAP-l-RBCC  association  (data  not  shown).  The 
deletion  of  B  box  has  the  same  effect  as  the  KRAB 
(DV-AA)  mutant,  which  has  been  shown  to  abo¬ 
lish  KAP-1  binding  and  repression  function.10 
This  result  suggests  that  the  B  box  is  necessary 
for  KRAB(AB)  binding  to  the  RBCC  domain  and 
is  consistent  with  previously  published  in  vivo 
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Figure  4.  (a)  The  binding  of  the  KAP-l-RBCC  to  KRAB(AB),  KRAB(Ab),  KRAB(AC)  but  not  mutant  form  as  detected 
by  GST  association  assay.  The  input  lanes  represent  5  gg  of  purified  KAP-l-RBCC  protein  that  was  added  to  each  binding 
reaction  mixture.  No  binding  was  detected  for  the  GST-KOXl-KRAB(A)  and  GST-KOXl-KRAB(DV-AA).  Deletion  of  the 
B  box  of  KOX1-KRAB  has  the  same  effect  as  the  KRAB(DV-AA)  mutant,  (b)  Competition  of  the  GST-KOXl-KRAB(AB): 
KAP-l-RBCC  complex  formation  by  the  purified  KOXl-KRAB(AB),  AJ18-KRAB(Ab),  and  ZNF274-KRAB(AB)-SCAN 
proteins.  A  constant  amount  of  GST-KOXl-KRAB(AB)  (5  gg)  was  mixed  with  increasing  amounts  of  one  of  the 
competitors  (1  gg,  5  gg,  and  10  gg),  and  then  a  constant  amount  of  KAP-l-RBCC  (5  gg)  was  added.  The  reaction  was 
incubated  at  equilibrium  conditions  for  one  hour  at  room  temperature.  AJ18-KRAB(Ab)  and  ZNF274-KRAB(AB)-SCAN 
are  very  efficient  competitors  of  the  GST-KOXl-KRAB(AB):KAP-l-RBCC  interaction. 
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studies.44  However,  deletion  of  the  b  box  or  C 
box  in  KRAB(Ab)  or  KRAB(AC)  had  no  effect  on 
their  binding  to  the  RBCC  domain,  indicating  that 
the  A  box  in  the  context  of  KRAB(Ab)  and  KRAB 
(AC)  is  sufficient  to  bind  the  RBCC  domain  in 
vitro  and  is  consistent  with  previously  published 
in  vivo  studies.44  Together,  the  above  data  indi¬ 
cate  that  the  A  box  in  KRAB(AB)  might  fold  dif¬ 
ferently  in  KRAB(Ab)  or  KRAB(AC);  and  that  the 
B  box  in  KRAB(AB)  is  needed  for  recognition  of 
the  RBCC  domain,  possibly  for  proper  folding  of 
A  box  of  KRAB(AB). 

To  determine  whether  the  KRAB(AB)  and  KRAB 
(Ab)  domain  have  the  same  binding  affinity  to  the 
RBCC  domain,  we  performed  a  competition  assay 
under  equilibrium  conditions.  The  GST-KOX1- 
KRAB(AB)  was  first  immobilized  on  GST  resin, 
and  increasing  amounts  of  competitors  including 
KOXl-KRAB(AB),  ZNF274-KRAB(AB)-SCAN,  and 
AJ18-KRAB(Ab)  were  added  to  the  resin,  followed 
by  addition  of  a  constant  amount  of  KAP-l-RBCC 
protein.  The  reaction  was  then  incubated  under 
equilibrium  conditions.  The  results  show  that 
ZNF274-KRAB(AB)-SCAN  and  AJ 1 8-KR AB ( Ab ) 
are  very  efficient  and  comparable  competitors  for 
the  GST-KOXl-KRAB(AB):KAP-l-RBCC  interac¬ 
tion  (Figure  4(b)). 

Stoichiometry  of  the  RBCC  domain  of  KAP-1  to 
zinc  is  1:4 

We  previously  defined  the  RBCC  region  to  be 
sufficient  and  necessary  for  oligomerization  of 
KAP-1  and  KRAB  domain  binding  using  recombi¬ 
nant  proteins  in  vitro,12  although  two  B  boxes  and 
the  coiled-coil  region  were  shown  to  be  sufficient 
for  interaction  with  the  KRAB  domain  using  a  in 
vivo  mammalian  two-hybrid  assay.47  Obviously, 
the  KAP-l-RBCC  mediates  a  highly  specific,  direct 
interaction  with  KRAB  domain,  and  it  functions  as 
an  integrated,  cooperative  structural  unit  wherein 
each  sub-domain  contributes  to  oligomerization 
and  ligand  recognition.  The  RING  finger  is  a 
cysteine-rich  motif  of  the  form  C3HC4  and  binds 
two  molecules  of  zinc  in  a  unique  cross-braced 
ligation  system.48-50  The  RING  finger  is  likely 
to  contribute  either  to  specificity  and  /  or  multi- 
merization  properties  of  the  RBCC  motif.  RING 
finger  structures  suggest  that  a  common  hydro- 
phobic  core  is  formed  as  a  result  of  zinc-chelation, 
and  that  the  variable  sequence  between  the 
conserved  ligation  residues  provides  specificity 
for  the  protein  recognition.  The  B-box  is  also  a 
cysteine-rich  zinc  binding  motif  in  the  form  of 
CHC3H2.51  The  NMR  structure  of  the  XNF7  B-box 
shows  that  only  one  zinc  atom  is  bound  with  the 
other  potential  chelation  residues  unoccupied.  The 
coiled-coil  motif  is  comprised  of  appropriately 
spaced  hydrophobic  residues  predicted  to  form  an 
extended  a-helical  region.  The  coiled-coil  region  of 
KAP-1  is  predicted  to  fold  into  two  potential 
leucine  zipper-like  motifs,12  52  where  each  region 
is  likely  to  be  an  amphipathic  a-helix. 


The  conservation  of  eight  potential  metal  ligands 
in  the  RING  finger  and  B-box  in  the  RBCC  domain 
suggests  that  it  is  a  zinc-binding  protein.  Mutations 
of  these  zinc-chelating  cysteine  and  histidine  inter¬ 
rupt  the  folding  of  the  molecule  and  abolish  KRAB 
domain  binding.12  To  determine  the  identity  and 
stoichiometry  of  metal  bound  by  the  RBCC  domain, 
we  used  inductivity  coupled  plasma  (ICP)  spectro¬ 
metry.  These  measurements  indicate  that  330  gM 
KAP-l-RBCC  protein  contains  1.  54  mM  zinc,  and 
33  gM  KAP-l-RBCC  protein  contains  143  gM  zinc. 
The  ratio  of  one  protein  molecule  to  zinc  atoms  is 
1:4.6  and  1:4.29,  respectively,  indicating  that  RBCC 
domain  binds  four  zinc  atoms  per  protein  molecule. 
The  negative  control  of  His-tagged  protein  shows 
very  low  background  binding  to  zinc  atoms  (data 
not  shown).  Potentially,  the  RBCC  domain  utilizes 
all  eight  conserved  cysteine  and  histidine  residues 
from  the  RING  finger  for  metal  ligation.  Only  four 
conserved  cysteine  and  histidine  residues  from  the 
B1  and  B2  boxes  are  used  for  metal  chelation.  This 
result  is  consistent  with  observations  from  the  NMR 
structure  of  the  XNF7  B-box.53  Together,  the  muta¬ 
tion  study12  and  zinc  content  analyses  indicate  that 
the  RBCC  domain  requires  zinc  for  folding. 

Reconstitution  of  KRAB: KAP-1 -RBCC  protein 
complex  in  vivo  using  a  two-plasmid  expression 
system 

We  previously  mapped  the  interaction  domains 
for  both  KOXl-KRAB(AB)  and  KAP-l-RBCC  pro¬ 
teins  in  vivo  and  in  vitro  and  reconstituted  the 
protein  complex  using  these  highly  purified  pro¬ 
teins.  However,  the  efficiency  of  forming  this  protein 
complex  is  not  high  enough  to  allow  us  to  do 
biophysical  and  structural  studies.  Therefore,  we 
developed  a  two-plasmid  expression  system  to  co¬ 
express  and  co-purify  the  protein  complex.  This 
system  increases  protein  expression,  solubility, 
efficiency  of  protein  complex  formation,  and  over¬ 
all  complex  stability.  As  a  result,  we  successfully 
obtained  a  highly  purified  and  stable  protein 
complex  (Figure  5(a)).  The  stoichiometry  of  KRAB: 
KAP-l-RBCC  was  then  determined  by  sedimenta¬ 
tion  equilibrium  analysis.  The  datum  followed  the 
model  predicting  a  1:3  molar  ratio  of  KRAB(AB): 
KAP-l-RBCC  complex  (79%,  81%,  and  92%  in  three 
protein  concentrations,  respectively)  and  a  small 
amount  of  aggregate  (21%,  19%,  and  8%,  respec¬ 
tively)  at  a  speed  of  10,000  rpm  (Figure  5(b)).  This 
result  was  consistent  with  previous  observations 
from  densitometry  and  kinetic  studies  of  the  KRAB 
(AB):KAP-1-RBCC  protein  complex.11 

The  entire  RBCC  domain  of  KAP-1  and  the  AB 
box  of  KRAB  forms  a  core  complex 

To  determine  the  core  complex  of  the  KRAB:KAP- 
1-RBCC  proteins,  we  employed  limited  proteolysis 
experiment,  where  the  core  would  be  protected  from 
the  protease  digestion.  Proteases  including  trypsin, 
proteinase  K,  subtilisin,  and  thermolysin  were  used 
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Figure  5.  (a)  Gel  filtration  analysis  of  the  KOXl-KRAB(AB):KAP-l-RBCC  protein  complex.  The  reconstitution  of 
KOXl-KRAB(AB):KAP-l-RBCC  protein  complex  in  vivo  is  described  in  Materials  and  Methods.  The  protein  complex  was 
resolved  on  a  Superdex  200  column.  The  fractions  were  collected,  and  the  proteins  were  analyzed  by  SDS-PAGE.  The 
peak  fraction  of  the  complex  (fraction  11)  eluted  with  a  molecular  mass  of  ~  158  kDa.  (b)  Analytical  ultracentrifugation 
analysis  of  KOXl-KRAB(AB):KAP-l-RBCC  protein  complex.  Sedimentation  equilibrium  analysis  was  performed  at 
10,000  rpm  and  4  °C.  Panel  IV  shows  the  concentration  versus  radius  data  for  three  loading  concentration  of  the  protein 
complex  (circles).  The  three  data  sets  were  fitted  globally  with  a  model  describing  a  predominant  complex  with  1:3  molar 
ratio  of  KOXl-KRAB(AB):KAP-l-RBCC  and  small  amount  of  aggregate.  I— III  shows  the  residuals  for  the  data  points  to 
the  fitted  curves  at  the  three  protein  concentrations  displayed  from  highest  concentration  (top)  to  lowest  concentration,  (c) 
Trypsin  limited  proteolysis.  Mass  spectrometry,  and  N-terminal  sequence  analyses  for  KOX1-KRAB(AB)1-90:KAP- 
RBCCHIS,22^18  protein  complex.  The  protein  complex  was  treated  with  trypsin  (0,  lane  1;  0.04  jj.g/jj.1,  lane  2;  and 
0.008  jug/ pi,  lane  3).  Aliquots  of  each  reaction  were  analyzed  by  SDS-PAGE  and  mass  spectrometry.  Bands  A,  B,  C,  and  D 
were  subjected  N-terminal  sequence  analysis,  (d)  A  diagram  illustrating  the  products  after  the  trypsin  treatment.  Two 
major  fragments  were  produced  from  KOX1-KRAB1-90  and  three  fragments  were  yielded  from  the  KAP-l-RBCCHIS, 
22-418.  The  numbers  represent  amino  acid  position. 
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for  limited  proteolysis  on  the  highly  purified  KOX1- 
KRAB(AB)1-90:KAP-1-RBCCHIS,  22-418  protein 
complex.  Consistent  digested  bands  were  produced 
by  these  proteases  for  the  protein  complex  (data  not 
shown).  The  unbound  KAP-l-RBCCHIS,  22-418 
protein  and  KOX1-KRABHIS,1-90  protein  were 
also  subjected  to  limited  proteolysis.  No  significant 
change  in  the  digestion  pattern  was  observed  by 
SDS-PAGE  for  the  unbound  KAP-l-RBCCHIS,  22- 
418  compared  to  its  pattern  in  the  protein  com¬ 
plex  (data  not  shown).  Conversely,  the  unbound 
KOX1-KRABHIS,1-90  protein  was  digested  to  a 
series  small  fragments  while  only  partial  digestion 
occurred  in  its  complexed  counterpart  (Figure  5(c); 
data  not  shown).  These  data  indicate  that  only  the 
KOX1-KRAB  protein  bound  to  the  RBCC  domain  of 
KAP-1  was  protected  from  protease  digestion. 
Trypsin  was  chosen  to  represent  one  of  the  limited 
proteolyses  products  from  the  protein  complex. 
Mass  spectrometry  and  N-terminal  sequence  ana¬ 
lyses  of  selected  bands  of  A,  B,  C,  and  D  were 
performed  after  limited  proteolysis  (Figure  5(c)). 
The  results  show  that  two  major  fragments  were 
produced  from  KOX1-KRAB(AB)1-90  (Figure  5(d)). 
The  first  fragment  (KOX1, 12-90)  includes  the  entire 
A  box  and  B  box.  The  second  fragment  (KOX1,  12- 
65)  contains  the  complete  A  box  and  the  N-terminal 
half  of  the  B  box.  Three  major  fragments  were 
produced  from  KAP-l-RBCCHIS,  22-418.  The  first 
fragment  (KAP-1,  33-418)  includes  the  entire  RING 
finger,  two  B  boxes,  and  coiled-coil  region;  the 
second  fragment  (KAP-1,  142-418)  contains  two  B 
boxes  and  coiled-coil  region;  and  the  third  fragment 
(KAP-1, 200-418)  includes  the  B2  box  and  coiled-coil 
region.  Guided  by  the  program  Predictprotein  two 
protein  complexes  were  made  using  the  two- 
plasmid  expression  system  (Table  1)  including  (1) 
KOXl-KRAB(AB)  12-90  and  KAP-1-RBCC50-418, 
and  (2)  KOXl-KRAB(A)12-65  and  KAP-l-RBC 
C50-418.  These  two  protein  complexes  were  highly 
expressed  and  soluble.  However,  deletion  of  the 
RING  finger  and/or  B1  box  in  RBCC  resulted  in 
unstable  and  insoluble  proteins  with  or  without  the 
co-expression  of  KOX1-KRAB(AB)12-90.  Taken 
together,  these  data  suggest  that  the  core  complex 
may  be  comprised  of  KOX1-KRAB(AB)12-90  and 
the  entire  RBCC  motif  of  KAP-1. 

Stoichiometry  of  the  KRAB(Ab):RBCC  complex 
is  1:3 

Although  it  has  been  demonstrated  that  one 
molecule  of  KRAB(AB)  associates  with  three  mole¬ 
cules  of  KAP-l-RBCC,  it  has  not  been  determined 
whether  this  stoichiometry  exists  for  KRAB(Ab).  To 
determine  the  ratio  of  AJ18-KRAB(Ab)  to  KAP-l- 
RBCC,  the  two  purified  proteins  were  mixed 
together,  and  then  the  complex  was  isolated  by  gel 
filtration  (Figure  6(a)).  The  resulting  KRAB(Ab): 
KAP-l-RBCC  protein  complex  was  analyzed  using 
sedimentation  equilibrium.  The  data  were  best 
described  by  a  model  containing  a  complex  in  1:3 
molar  ratio  of  AJ18-KRAB(Ab):KAP-l-RBCC  (81%, 


85%,  and  94%  in  three  protein  concentrations, 
respectively),  and  small  amount  of  aggregate  (19%, 
15%,  and  6%,  respectively)  at  speed  of  10,000  rpm 
(Figure  6(b)).  This  result  indicates  that  one  molecule 
of  AJ18-KRAB(Ab)  directly  interacts  with  three 
molecules  of  KAP-l-RBCC.  The  AJ18-KRAB(Ab) 
use  the  same  mode  as  KOX-KRAB(AB)  to  bind  to 
the  KAP-l-RBCC. 

Next,  to  determine  if  the  secondary  structure  of 
the  KRAB(Ab)  is  changed  upon  the  binding  of  the 
KAP-l-RBCC,  AJ18-KRAB(Ab)  protein  was  titrated 
with  the  RBCC  protein  and  analyzed  by  CD  spectra. 
For  comparison,  free  and  bound  KRAB(Ab)  spectra 
are  overlaid  (Figure  7).  The  CD  spectra  show  little 
difference  between  the  free  and  bound  KRAB(Ab). 
The  data  suggest  that  only  a  subtle  change  in 
secondary  structure  of  the  KRAB  domain  occurs 
upon  the  binding  of  the  RBCC  domain.  We 
entertained  the  possibility  that  more  sensitive  and 
dynamic  techniques  may  be  necessary  to  monitor 
any  conformation  change  of  the  KRAB  domain  that 
powerful  tools  for  characterizing  the  folded  state  of 
occurs  upon  binding  of  the  KAP-l-RBCC. 

Potential  induced  structure  of  the  KRAB  domain 
by  the  RBCC  domain  of  KAP-1 

NMR  and  CD  spectroscopies  are  powerful  tools 
for  characterizing  the  folded  state  of  proteins  in 
solution.  Typically,  a  well-folded  protein  exhibits 
large  1 H  chemical  shift  dispersion  and  a  single  set  of 
resonances,  for  example,  in  the  2D  lH-15N  hetero- 
nuclear  single  quantum  coherence  (HSQC)  experi¬ 
ment.  Figure  8  shows  the  HSQC  spectra  of  His-AJ18 
KRAB(Ab)  (Figure  8(a)  and  (b))  and  ZNF274-KRAB 
(AB)  (Figure  8(d)).  In  general,  the  majority  of 
backbone  amide  resonances  exhibit  poor  dispersion, 
indicating  both  constructs  have  substantial  regions 
of  disordered  structure  and  do  not  fold  into  a  single 
globular  protein  conformation,  consistent  with  CD 
results.  This  is  particularly  true  for  the  ZNF274- 
KRAB(AB)  construct,  where  almost  all  backbone 
resonances  fall  (>95%)  within  the  center  of  the 
spectrum  (7.7  ppm  to  8.5  ppm).  Some  resonances  are 
broadened  or  missing  (~  50  out  of  75  backbone 
resonances  can  be  readily  detected)  indicating 
intermediate  timescale  motions.  This  might  reflect 
exchange  between  equally  unfolded  conformations 
and/or  the  presence  of  some  transient  local  struc¬ 
ture.  It  is  important  to  note  that  unfolded  proteins 
can  contain  nascent  structure.  The  presence  of 
nascent  structure  is  consistent  with  the  CD  studies 
and  the  presence  of  some  resonances  outside  the 
"unstructured  envelope"  of  the  HSQC  spectrum. 
The  His-AJ18-KRAB(Ab)  construct,  displays  greater 
spectral  heterogeneity,  some  of  which  is  due  to  the 
presence  of  the  His-tag  (see  below).  Nevertheless,  a 
core  of  sharp  and  broad  resonances  are  observed  for 
a  narrow  chemical  shift  range  (~80  resonances 
between  7.9  ppm  and  8.8  ppm  in  Figure  8(a)  and 
(b)),  consistent  with  a  mostly  disordered  configura¬ 
tion.  At  very  low  contour  levels,  a  number  of  very 
broad  /  weak  resonances  are  observed,  indicating  the 
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presence  of  exchange  between  unfolded  conformers 
characterized  by  some  nascent  structure  (Figure 
8(b)).  Importantly,  2D  nuclear  Overhauser  enhance¬ 
ment  spectroscopy  (NOESY)  of  both  these  samples 
exhibited  minimal  NOE  cross-peaks  (data  not 
shown)  providing  further  evidence  of  lack  of  a  well 


defined  3D  conformation.  The  ID  'H  spectra  of  AJ18- 
KRAB(Ab),  His-AJ18-KRAB(Ab)  and  ZNF274- 
KRAB(AB)  (Figure  9(a),  (b)  and  (c))  further  confirm 
the  lack  of  structure  for  the  KRAB  fragments.  Note 
the  absence  of  resonances  below  0  ppm,  indicating 
absence  of  ring  current  shifted  methyl  resonances  (in 
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Figure  7.  The  CD  spectra  of  the  free  and  bound  AJ18- 
KRAB(Ab)  protein.  CD  spectra  were  carried  out  in  a  buffer 
containing  20  mM  Hepes  (pH  7.5),  100  mM  NaCl,  1  mM 
TCEP  at  a  protein  concentration  of  0.1  mg/ml  (AJ18- 
KRAB(Ab)),  and  0.6  mg/ ml  (KAP-l-RBCC),  respectively. 
A  CD  spectrum  for  a  AJ18-KRAB(Ab)  protein  containing 
65%  random  coil  and  35%  helix  is  plotted  (blue).  A  CD 
spectrum  for  a  AJ18-KRAB(Ab)  protein  titrated  with 
KAP-l-RBCC  protein  containing  65%  random  coil  and 
35%  helix  is  plotted  (red). 


these  constructs,  there  are  18/19  Ile,Val  or  Leu 
residues  and  8/7  aromatic  residues).  Significantly, 
removal  of  the  12  residue  hexa  His-tag  (compare 
Figure  9(a)  and  (b)),  greatly  reduces  the  1H  chemical 
shift  dispersion  for  the  AJ18-KRAB(Ab)  construct 
and  sharpens  the  spectrum,  indicating  a  reduction  in 
any  nascent  structure.  This  is  concomitant  with  a  CD 
results,  which  show  a  decrease  in  helical  structure 
content  upon  removal  of  the  His-tag.  Consistent  with 
these  studies,  the  proteolysis  of  the  free  KRAB 
constructs,  clearly  indicates  that  any  such  structure 
is  transient  as  it  does  not  lead  to  protection  from 
proteases.  Overall,  the  qualitative  analysis  of  the 
NMR  spectra,  coupled  with  the  proteolysis  and  CD 
results,  indicate  that  the  KRAB  fragments,  in  their 
active  form,  do  not  exhibit  properties  normally 
associated  with  folded  globular  proteins.  These 
data  strongly  suggest  that  the  polypeptide  is  largely 
disordered,  although  there  is  evidence  of  limited 
structure  that  may  be  transiently  populated  and 
critical  for  activity. 

To  assess  whether  the  KRAB  domain  folds  into  an 
ordered  structure  upon  the  formation  of  a  complex 


with  KAP-1,  experiments  were  carried  out  using  15N 
labeled  His-AJ18-KRAB(Ab)  and  un labeled  KAP-l- 
RBCC  proteins.  To  ensure  that  the  KRAB(Ab) 
protein  was  indeed  in  a  complex,  the  proteins 
were  purified  separately,  incubated  together  and 
then  complexes  of  the  appropriate  size  were  isolated 
by  gel  filtration  (Figure  6(a)).  Free  and  bound  KRAB 
(Ab)  fractions  were  then  concentrated  and  analyzed 
with  NMR.  The  1H-15N  HSQC  spectra  of  the  free 
KRAB(Ab)  and  the  complex  are  shown  in  Figure  8(a) 
and  (c).  Only  signals  from  the  KRAB(Ab),  but  not 
from  KAP-l-RBCC,  were  observed  since  unlabeled 
KAP-l-RBCC  was  used.  For  the  KRAB(Ab)  in  the 
complex,  there  are  fewer  peaks  present  than  evident 
in  the  free  spectrum.  The  loss  of  signals  is  consistent 
with  KRAB(Ab)  binding  to  the  much  larger  KAP-l- 
RBCC  protein.  Given  that  the  KRAB(Ab):KAP-l- 
RBCC  complex  is  1:3  molar  ratio,  the  complex  is  a 
180  kDa  species.  At  this  molecular  mass,  molecular 
tumbling  is  slow  therefore  the  NMR  signals  are  too 
broad  to  be  readily  detected.  Thus,  the  loss  of 
observable  peaks  in  the  complex  versus  in  the  free 
spectra  indicates  that  KRAB(Ab)  was  binding  to 
KAP-l-RBCC.  Clearly,  there  are  some  peaks 
remaining  in  the  complex  spectra.  However,  the 
peaks  that  are  still  present  likely  represent  mobile 
NH2  and  COOH  termini  of  KRAB(Ab)  that  are  not 
making  direct  contacts  to  the  KAP-l-RBCC.  These 
data  are  consistent  with  the  limited  proteolysis  and 
mass  spectrometry  analysis,  for  example  these 
predicted  three  Gly  residues  to  be  visible  in  the 
NMR  spectrum,  which  is  observed.  Results  from 
total  correlated  spectroscopy  (TOCSY)  experiments 
were  also  consistent  with  both  the  NH2  and  COOH 
termini  being  flexible  (data  not  shown).  Without 
further  analysis,  which  is  beyond  the  scope  of  the 
present  study,  it  is  not  possible  to  definitively  assess 
the  conformational  state  of  the  KRAB  in  complex 
with  KAP-l-RBCC.  However,  the  results  of  the 
proteolysis  of  the  KRAB:KAP-1  complex,  suggest 
that  KRAB  is  well  folded  upon  binding  KAP-1.  The 
presence  of  conformational  exchange  or  mobile 
regions  outside  the  N  and  C  termini  of  KRAB 
would  lead  to  a  greater  degree  of  digestion.  In 
addition,  the  spectrum  of  KRAB  in  complex  with 
KAP-l-RBCC  exhibits  no  heterogeneity,  which 
would  be  expected  if  some  chemical  exchange 
processes  were  present.  Finally,  some  peaks  were 
found  to  undergo  chemical  shift  perturbation,  which 
may  result  from  a  different  chemical  environment 
existing  near  the  complex  interface  upon  complex 


Figure  6.  (a)  Gel  filtration  analysis  of  the  AJ18-KRAB(Ab):KAP-l-RBCC  protein  complex.  The  reconstitution  of  AJ18- 
KRAB(Ab):KAP-l-RBCC  protein  complex  in  vitro  is  described  in  Materials  and  Methods.  Individual  proteins  were 
purified  separately  and  then  mixed  together.  The  mixture  was  subjected  to  gel  filtration  to  separate  unbound  AJ18-KRAB 
(Ab)  protein.  The  tractions  were  collected  and  the  proteins  were  analyzed  by  SDS-PAGE.  The  peak  fraction  of  the  complex 
(fraction  26)  eluted  with  a  molecular  mass  of  ~  158  kDa.  Significant  amounts  of  AJ18-KRAB(Ab)  was  bound  to  the  KAP- 
l-RBCC  protein,  (b)  Analytical  ultracentrifugation  analysis  of  AJ18-KRAB(Ab):KAP-l-RBCC  protein  complex. 
Sedimentation  equilibrium  analysis  was  performed  at  10,000  rpm  and  4  °C.  Panel  IV  shows  the  concentration  versus 
radius  data  for  three  loading  concentration  of  the  protein  complex  (circles).  The  three  data  set  were  fitted  globally  with  a 
model  describing  a  predominant  complex  with  1:3  molar  ratio  of  AJ18-KRAB(Ab):KAP-l-RBCC  and  small  amount  of 
aggregate.  I— III  shows  the  residuals  for  the  data  points  of  the  fitted  curves  at  the  three  protein  concentrations  displayed 
from  highest  concentration  (top)  to  lowest  concentration. 
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(c)  His-AJ18-KRAB(Ab):KAP-l-RBCC  complex,  and  (d)  ZNF274-KRAB( AB) . 
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formation,  and/or  due  to  folding.  These  data,  taken 
together  with  the  limited  proteolysis  data,  strongly 
suggest  that  addition  of  the  KAP-l-RBCC  leads  to 
the  folding  of  KRAB  domain. 


Discussion 

Here,  we  systematically  evaluated  the  oligomeric 
states  of  KRAB  domains  from  KRAB-ZNPs  alone  or 
bound  in  a  complex  with  KAP-l-RBCC.  We  also 
determined  their  binding  affinity  and  the  stoichio¬ 
metry  of  KRAB:KAP-1-RBCC  association.  We  con¬ 
clude  the  following  from  our  data.  (1)  The  KRAB 
(AB)  and  KRAB(Ab)  domains  exist  as  a  monomer 
either  alone  or  in  a  complex  with  the  RBCC  domain 


of  KAP-1,  while  the  KRAB(AB)-SCAN  domain 
exists  as  a  dimer.  (2)  The  interaction  between  a 
KRAB  and  KAP-l-RBCC  is  direct  and  specific,  and 
it  requires  only  A  box  in  KRAB(Ab)  or  KRAB(AC) 
but  both  A  and  B  boxes  in  KRAB(AB).  (3)  The  KRAB 
(Ab)  has  an  equivalent  binding  capacity  to  KAP-l- 
RBCC  compared  to  the  KRAB(AB).  (4)  A  stable 
complex  can  be  formed  between  the  KRAB  and 
KAP-l-RBCC  proteins  in  vivo  and  in  vitro,  and  one 
molecule  of  KRAB  directly  interacts  with  three 
molecules  of  KAP-l-RBCC.  (5)  Limited  proteolysis, 
mass  spectrometry  and  N-terminal  sequence  ana¬ 
lyses  indicate  that  the  core  complex  is  formed  by 
the  entire  RBCC  domain  of  KAP-1  and  the  AB  box 
of  the  KRAB.  (6)  NMR  spectroscopy  indicates  that 
unbound  KRAB  domain  is  structurally  disordered 
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Figure  9.  ID  :H  spectra  of  (a) 
AJ18-KRAB(Ab),  (b)  His-AJ18- 
KRAB(Ab),  and  (c)  ZNF274-KRAB 
(AB). 


in  solution  and  actively  binds  its  target,  the  KAP-1  - 
RBCC  protein. 

Previous  studies  using  purified  recombinant 
proteins  indicated  that  the  RBCC  domain  of  KAP-1 
is  necessary  and  sufficient  for  interaction  with 
KOXl-KRAB(AB),  and  this  interaction  is  direct 
and  specific.1  12  We  now  demonstrate  that  direct 
interaction  also  exists  between  the  KRAB(Ab), 
KRAB(AC),  KRAB(AB)-SCAN  and  the  KAP-1- 
RBCC,  which  is  consistent  with  the  observation 
from  the  yeast  two-hybrid  analysis  of  the  interaction 
between  different  KRAB  domains  of  the  KRAB- 
ZFPs  and  the  TIF1  protein  family.  This  interaction  is 
stable  when  subjected  to  in  vitro  biochemical 
manipulation  as  detected  by  GST  association  assay, 
gel  filtration,  and  ultracentrifugation.  Moreover,  the 
binding  affinity  of  KRAB(Ab)  to  RBCC  is  compar¬ 
able  to  that  of  KRAB(AB)  as  determined  by  the 
competition  assay.  These  data  provide  biochemical 
evidence  in  support  of  the  proposed  model  that 
KAP-1  serves  as  a  co-repressor  for  the  whole  KRAB- 


ZFP  family.  However,  it  remains  unclear  why  the  B 
box  in  the  context  of  KRAB(AB)  is  necessary  for 
binding  to  the  RBCC,  while  the  b  or  C  box  in  the 
context  of  KRAB(Ab)  or  KRAB(AC)  is  not. 

Although  it  has  been  demonstrated  that  the  KRAB 
domain  binding  induces  and  /  or  stabilizes  the  trimer 
of  the  KAP-l-RBCC  protein  and  that  the  KOX1- 
KRAB  domain  functions  as  a  monomer  to  bind  to 
the  homotrimer  of  the  KAP-l-RBCC  protein,1112 
little  is  known  about  its  structure  and  the  protein- 
protein  interface  of  this  complex.  Here,  we  demon¬ 
strate  for  the  first  time  using  sedimentation  equili¬ 
brium  that  one  molecule  of  KRAB  directly  interacts 
with  three  molecules  of  KAP-l-RBCC  in  a  stable 
highly  purified  protein  complex.  This  datum  is 
consistent  with  previous  findings  using  an  optical 
biosensor  assay  and  quantitation  analysis  by  den¬ 
sitometry  in  vitro.11  Moreover,  we  confirmed  by 
limited  proteolysis,  mass  spectrometry  and  N- 
terminal  sequence  analyses  that  both  proteins  form 
a  core  complex,  which  is  composed  of  the  entire 
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RBCC  domain  of  KAP-1  and  AB  box  of  the  KRAB 
domain.  This  result  is  consistent  with  the  previous 
observations  that  the  RBCC  domain  functions  as  an 
integrated  and  cooperative  structural  unit  wherein 
each  sub-domain  contributes  to  oligomerization 
and/or  ligand  recognition. 1_ 

The  most  important  finding  from  this  study  is  that 
the  unbound  KRAB  domain  is  structurally  disor¬ 
dered  in  solution  and  actively  binds  its  target,  the 
KAP-l-RBCC  protein.  Initially,  we  chose  to  use  the 
KOX1-KRAB  in  a  complex  with  the  KAP-l-RBCC 
for  screening  crystal  formation,  since  both  proteins 
can  be  co-expressed  and  co-purified  using  a  two- 
plasmid  system  to  form  a  stable  complex.  However, 
no  crystal  formed  from  these  protein  complexes.  We 
also  screened  other  KRAB  domains  from  the  KRAB 
(Ab)-ZFP  and  KRAB(AB)-SCAN-ZFP  subfamilies 
for  crystal  formation,  since  these  proteins  are  highly 
soluble  and  stable  in  solution.  Again,  these  KRAB 
domains  failed  to  form  crystals.  Therefore,  we 
applied  NMR  techniques  to  elucidate  the  structural 
and  dynamic  characteristics  of  the  KRAB  domain 
molecules.  NMR  has  been  successfully  utilized  to 
determine  the  functional  roles  and  structures  of  the 
structurally  disordered  proteins  which  have  basic 
regulatory  roles  in  key  cellular  processes  such  as 
transcriptional  regulation,  translation  and  cellular 
signal  transduction. 54-56  One  of  the  most  well 
characterized  examples  of  this  is  the  kinase-induci¬ 
ble  transcriptional-activation  domain  (KID)  of  CREB 
transcriptional  activator.  The  KID  polypeptide  is 
intrinsically  disordered,  as  both  an  isolated  peptide 
and  in  full-length  CREB.  It  folds  to  form  a  pair  of 
orthogonal  helices  upon  binding  to  its  targei  (KIX) 
domain  in  CBP  transcriptional  co-activator.  Other 
examples  of  synergistic  folding  during  the  complex 
formation  are  the  nuclear-receptor  co-activator¬ 
binding  domain  (NCBD)  of  CBP  and  the  activator 
for  thyroid  hormone  and  retinoid  receptor  domain 
(ACTR)  of  the  pl60  nuclear-receptor  co-activator 
and  the  Mad-Sin3  repressor-co-repressor  complex. 
Both  of  these  protein  partners  undergo  mutual 
folding  transitions  upon  complex  formation. 57  Simi¬ 
larly,  we  have  found  that  KRAB  domains  are  also 
structurally  disordered  and  that  KAP-1  association 
aids  in  folding  KRAB  domains. 

The  benefits  of  structurally  disordered  proteins 
have  been  elucidated.38'56  One  of  the  advantages  is 
that  unfolded  protein  domains  undergo  a  significant 
disorder-order  transition  which  facilitates  molecu¬ 
lar  recognition.  These  unfolded  proteins  remain  in 
an  open/extended  conformation  when  they  com¬ 
plex  with  their  target.  This  provides  a  disproportio¬ 
nately  large  binding  surface  and  multiple  contact 
points  for  protein-protein  interaction.  The  KRAB 
domain  is  highly  enriched  in  hydrophobic  amino 
acid  residues  and  cannot  bury  enough  of  its 
hydrophobic  surface  to  form  a  stable  globular 
structure  in  the  absence  of  a  binding  partner.  A 
molecule  of  the  KRAB  domain  (Mr  is  10  kDa)  is 
much  smaller  than  the  RBCC  domain  of  KAP-1  (Mr 
is  45.9  kDa),  which  functions  as  a  homotrimer.  Each 
sub-domain  of  the  RBCC  motif  contributes  to  KRAB 


domain  recognition.  Therefore,  it  is  very  possible 
that  the  disordered  structure  of  the  KRAB  domain 
protein  facilitates  the  assembly  of  the  KRAB:KAP-1- 
RBCC  complex  by  forming  an  extensive  intermole- 
cular  interface.  This  provides  a  high  binding  spe¬ 
cificity  and  affinity  (dissociation  constant  (Kd)  = 
~  142  nM,  a  very  fast  association  and  a  comparably 
slow  dissociation  rate11). 

Whether  the  KAP-l-RBCC  is  structurally  disor¬ 
dered  has  not  been  determined  although  it  forms  a 
homotrimer  in  unbound  or  bound  in  a  complex  with 
the  KRAB.  The  RBCC  protein  is  relatively  unstable 
and  tends  to  precipitate  in  solution  without  its 
binding  partner,  KRAB,  suggesting  that  it  might  not 
be  fully  folded  or  that  it  is  prone  to  make  bodies.58 
The  RBCC  seems  to  associate  with  the  KRAB  by  a 
mutual  induced-fit  mechanism:  both  proteins  are 
significantly  more  structured  in  a  complex  than  in  a 
free  state.  This  is  consistent  with  our  previous 
observation  that  the  KRAB  stabilizes  the  RBCC 
trimer.12  The  RBCC  domain  requires  zinc  for  folding 
as  detected  by  ICP  spectrometry.  One  molecule  of 
RBCC  binds  four  zinc  atoms.  Eight  conserved 
cysteine  and  histidine  residues  in  the  RING  finger 
and  only  four  conserved  cysteine  and  histidine 
residues  in  each  of  the  B1  and  B2  boxes  are  used  for 
zinc  ligation.  These  data  are  well  correlated  with  a 
previous  finding  that  the  mutations  of  conserved 
pairs  of  cysteine /cysteine  and  cysteine /histidine 
residues  in  RING  finger  and  B2  box  disrupt  the 
structure  of  RBCC  domain  and  concomitantly 
abolish  the  KRAB  binding.12 

Recently  a  protein  containing  only  a  KRAB 
domain  (KRAB-O)  was  identified  to  interact  with 
the  SRY  protein  using  a  yeast  two-hybrid  screen.  56 
The  KRAB-O  lacks  a  zinc  finger  region  but  contains 
a  classical  A  box  and  B  box.  It  has  been  shown  to 
interact  with  the  BRG  region  of  the  SRY  protein  in 
vivo  and  in  vitro.  Finding  new  ligands  for  the  KRAB 
domain  would  open  new  doors  in  KRAB  domain 
structural  studies,  since  the  SRY  is  much  smaller 
than  the  RBCC  domain.  Whether  the  KRAB-O 
domain  is  structured  has  yet  to  be  determined.  If  it 
is  structurally  disordered  without  binding  to  its 
target,  then  the  binding  of  SRY  might  induce  the 
structure  of  the  KRAB  domain  so  that  this  complex 
could  be  detected  by  NMR  spectroscopy. 

Taken  together,  all  data  presented  here  fur¬ 
ther  support  the  current  working  model  for  KRAB- 
ZFP-KAP-l-mediated  transcriptional  repression.  A 
KRAB-ZFP  binds  target  gene  DNA  sequence-speci- 
fically  through  its  array  of  C2H2  zinc  fingers.  The 
DNA-bound  KRAB  domain  recruits  the  KAP-1  co¬ 
repressor  via  direct  interaction  with  the  RBCC 
domain.  KAP-1  apparently  self-assembles  into  a 
homotrimer,  which  can  actively  bind  the  KRAB 
domain.  The  HP1BD  (PXVXL  motif),  PHD  finger, 
and  bromodomain  of  KAP-1  comprise  the  surfaces 
that  mediate  gene  silencing  via  interaction  with  the 
downstream  targets  including  the  heterochromatin 
proteins,  NURD  histone  deacetylase  complex, 
SETDB1  histone  methyl  transferase,  and  other 
protein  modification  complexes  (A.  Ivanov,  unpub- 
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lished  data)."2-3*’  According  to  this  model,  we 
postulate  that  KAP-1  functions  as  a  molecular 
scaffold  that,  in  turn,  coordinates  the  activities  of 
large  macromolecular  complexes  that  modify  chro¬ 
matin  structure  to  silence  gene  expression. 

Materials  and  Methods 

Preparation  of  plasmids 

Single  expression 

The  KRAB  domain  of  KRAB-ZFPs  were  expressed  using 
(1)  IMPACT  I,  (2)  QIAexpress,  (3)  GST  Gene  fusion,  and  (4) 
Baculovirus  Express  systems,  respectively  (Figure  1;  Table 
1).  The  pQE30  KOX1-KRAB  (1-161),  and  pQE30  KOX1- 
KRAB  (1-90)  plasmids  were  described.11'12  The  pQE30 
KOX1-KRAB  (12-76),  pQE30  SZF1-KRAB  (3-110),  pQE30 
ZNF133-KRAB  (1-119),  pQE12  ZNF133-KRAB  (1-119), 
pQE30  ZNF140-KRAB  (1-119),  pQE30  ZNF141-KRAB 
(1-107),  pQE30  EEK1-KRAB  (1-122),  pQE30  KRK1-KRAB 
(16-127),  and  pQE30  AJ18-KRAB  (4-91)  plasmids  were 
constructed  via  PCR  using  KRAB-ZNF  cDNA  as  a 
template  and  primer  pairs  (data  upon  request).  The  PCR 
product  was  digested  and  cloned  into  the  corresponding 
sites  of  the  pQE  vector  (Qiagen).  The  GST-KOX1-KRAB 
(1-90),  GST-KOX1-KRAB  (DV),  GST-SZF1-KRAB  (3-110) 
plasmids  were  described11'4  The  GST-KOX1-KRAB 
(2-59),  GST-AJ18-KRAB-L  (4-216),  GST-AJ 1 8-KRAB  (4- 
91),  GST-AJ  1 8-KRAB  (4-58),  GST-MTZ1-KRAB  (2-81), 
GST-MTZ1-KRAB  (2-50),  GST-MZF13-KRAB  (2-70), 
GST-MTZ13-KRAB  (2M9),  GST-ZNF202-SCAN-KRAB 
(37-286),  GST-ZNF274-KRAB-SCAN  (4-246),  and  GST- 
ZNF274-KRAB  (4-77)  plasmids  were  generated  via  PCR 
using  KRAB-ZNF  cDNA  as  a  template  and  primer  pairs 
(data  upon  request).  The  PCR  product  was  digested  and 
cloned  into  the  corresponding  sites  of  the  pGEX  vector 
(GE  Healthcare). 

The  RBCC  domain  of  KAP-1  was  expressed  using  (1) 
QIAexpress,  and  (2)  Baculovirus  Express  systems,  respec¬ 
tively.  The  pQE30  KAP-l-RBCC  (22—418)  plasmid  was 
described.12  The  pQE30  KAP-l-RBCC  (50M18),  pQE30 
KAP-1-B1B2CC  (136-418),  and  pQE30  KAP-1-B2CC 
(199—418)  plasmids  were  constructed  by  amplifying  the 
DNA  fragments  encoding  desired  fragment  of  the  KAP-1 
gene  using  the  primer  pairs  (data  upon  request),  and  the 
PCR  product  was  digested  and  cloned  into  the  corre¬ 
sponding  sites  of  the  pQE30  vector. 

Co-expression 

The  KRAB  domain  and  the  RBCC  domain  complexes 
were  expressed  using  a  two-plasmid  express  system  in  (1) 
bacteria,  and  (2)  baculovirus,  respectively.  The  pRMl 
KOX1-KRAB  (1-90),  pRMl  KOX1-KRAB  (12-76),  pRMl 
KOX1-KRAB  (12-90),  pRMl  KOX1-KRAB  (12-65),  and 
pRMl  AJ18-KRAB  (4—91)  plasmids  were  produced  via 
PCR  using  KRAB-ZNF  cDNA  as  a  template  and  primer 
pairs  (data  upon  request).  The  PCR  product  was  digested 
and  cloned  into  the  pRMl  vector.  The  pQE30  KAP-l- 
RBCC  plasmid  was  generated  as  described  above. 

Protein  expression  and  purification 

For  single  expression,  the  plasmids  of  expressing  His- 
tagged  proteins  were  transformed  into  E.  coti  SG13009 


cells  (Qiagen),  and  the  plasmids  of  expressing  GST-fusion 
proteins  were  transformed  into  BL21  (DE3)  cells  (Invitro- 
gen).  For  co-expression,  the  plasmids  of  expressing  non- 
tagged  KRAB  domain  were  first  transformed  into  BL21 
(DE3)  cells,  which  were  then  chemically  treated  to  become 
competent  to  allow  the  transformation  of  the  second 
plasmid,  pQE30  KAP-l-RBCC. 

The  SG13009  and  BL21  (DE3)  cells  bearing  the  desired 
plasmid  were  propagated  with  aeration  at  25  °C  in  either  2 
X  YT  medium  or  minimal  media,  containing  15NH4C1  as 
the  sole  nitrogen  source  (for  uniformly  labeled  15N  pro¬ 
tein)  to  an  A60o  of  approximately  0.8.  IPTG  was  added  to 
1  mM,  and  growth  at  15  °C  was  continued  (~  18  h).  The 
cells  were  harvested  by  centrifugation. 

The  His-tagged  protein  or  co-expressed  protein  com¬ 
plex  was  purified  at  4  °C  under  native  conditions  as 
recommended  by  the  manufacturer  (Qiagen).12  The  bac¬ 
terial  pellet  was  resuspended  in  the  buffer  A  (20  mM 
Hepes  (pH  7.5),  500  mM  NaCl,  10%  (v/v)  glycerol) 
with  5  mM  imidazole,  and  incubated  with  lysozyme 
(100  |ig/ml)  for  30  min  in  ice.  Then  1  mM  PMSF  was 
added  into  the  cell  lysate,  followed  by  sonication  and 
centrifugation.  The  supernatant  fraction  containing 
soluble  protein  was  incubated  with  Ni-NTA  resin  for 
1  h.  The  resin  was  washed  with  buffer  A  with  20  mM 
imidazole  and  1  mM  PMSF,  loaded  into  a  column,  and 
washed  with  buffer  A  in  the  absence  of  glycerol.  The 
protein  or  protein  complex  was  eluted  with  buffer  A 
with  300  mM  imidazole,  followed  by  gel  filtration  and 
concentration. 

GST-fusion  proteins  were  purified  at  4  °C  following 
the  manufacturere's  instruction.  The  bacterial  pellet  was 
resuspended  in  buffer  A  with  10  mM  DTT,  and 
incubated  with  lysozyme  (see  above).  Then  1  mM 
PMSF  was  added  into  the  cell  lysate,  followed  by 
sonication  and  centrifugation.  The  supernatant  fraction 
containing  soluble  protein  was  incubated  with  GST  resin 
for  2  h.  The  resin  was  washed  with  buffer  A  without 
PMSF,  followed  by  thrombin  digestion  (Amersham)  in 
buffer  B  (20  mM  Hepes  (pH  7.5),  100  mM  NaCl,  10  mM 
DTT)  at  4  °C  (~20  h).  The  protein  was  released  from 
GST  in  buffer  B  with  1  mM  PMSF,  and  then  purified 
using  Q-Sepherose  column  in  buffer  B  with  NaCl  from 
100  mM  to  1  M.  The  protein  fractions  were  pooled  and 
concentrated,  subjected  to  gel  filtration  and  concentrated 
to  desired  concentration. 

Gel-filtration  analysis 

Gel  filtration  of  highly  purified  proteins  was  performed 
using  a  Superdex-200  or  -75  column  (Amersham)  equili¬ 
brated  in  the  appropriate  buffers  with  1  mM  TCEP,  using 
an  FPLC  system  as  described.12  The  column  was  run  at 
4  °C  and  fractions  were  collected.  Aliquots  of  each  fraction 
were  analyzed  by  SDS-PAGE. 

Dynamic  light  scattering 

All  measurements  were  carried  out  on  a  DynaPro-801 
molecular  sizing  instrument  at  4  °C.  Protein  samples 
(5  mg/ml)  were  run  and  pre-filtered  through  a  0.02  |rm 
membrane  filter  (Whatman,  Anodise  13)  prior  to  analysis. 

Circular  dichroism  spectroscopy 

Spectra  were  collected  on  a  Jasco  810  spectropolari- 
meter  at  25°  using  a  1  mm  path-length  cuvette  (Hellma). 
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Spectra  were  collected  over  the  wavelength  range 
200  nm-250  ran  using  a  resolution  of  0.5  ran  and 
bandwidth  of  1  ran  at  room  temperature.  Each  scan  was 
taken  at  10  ran  min-1  with  a  response  time  of  8  s. 
Spectra  were  baseline  corrected  for  buffer  contributions 
and  mean  residue  ellipticity  was  calculated  using  the 
program  JFITJ.  Concentrations  of  AJ18-KRAB(Ab),  274- 
KRAB-SCAN,  and  KAP-l-RBCCHIS  were  0.1  mg/ml, 
0.2  mg/ml,  and  0.6  mg/ml,  respectively. 

Analytical  ultracentrifugation 

Sedimentation  equilibrium  experiments  were  per¬ 
formed  as  described  previously.12  The  experiments  were 
performed  at  4  °C  using  a  minimum  of  two  speeds.  The 
fringe  displacement  data  was  collected  every  4-6  hours 
until  equilibrium  was  reached,  as  determined  by  compar¬ 
ing  successive  scans  using  the  MATCH  v.7  program1. 
Data  was  edited  using  REEDIT  v.9  program1  and 
analyzed  using  the  NONLIN  v.3  program1.46  The 
program  SEDNTERP  was  used  to  calculate  the  molecular 
weight  and  the  partial  specific  volume  of  the  proteins 
based  on  the  amino  acid  sequence.  This  program  was 
also  used  to  calculate  the  solvent  density.  Three  data  sets 
from  different  initial  loading  concentrations  were  fitted 
simultaneously.  Examination  of  the  residuals  and  mini¬ 
mization  of  the  variance  determined  the  goodness  of 
fit. 

1  MATCH,  REEDIT,  and  NONLIN  are  available  from  the 
Analytical  Ultracentrifugation  Facility  at  the  University  of 
Connecticut  via  the  FTP  site.  The  program  SEDNTERP 
was  written  by  T.  Laue,  J.  Hayes,  and  J.  Philo  and  is 
available  on  the  RASMB  web  site. 

GST  association  assays 

The  GST  association  assays  were  performed  as 
described.11 

Competition  assays 

A  total  of  5  pg  of  freshly  prepared  GST-KOX1-KRAB 
(AB)  fusion  protein  immobilized  on  GST  resin  was  mixed 
with  1  pg,  5  pg,  and  10  pg  of  KOXl-KRAB(AB),  AJ18- 
KRAB(Ab),  and  ZNF274-KRAB(AB)-SCAN  proteins  in 
200  pi  of  BB500  buffer  (20  mM  Tris  (pH  7.9),  500  mM 
NaCl,  0.2  mM  EDTA,  10%  glycerol,  0.2%  (v/v)  Nonidet 
P-40,  1  mM  PMSF)  with  1  mg  of  bovine  serum  albumin. 
The  5  pg  of  KAP-l-RBCCHIS  protein  were  then  added 
into  reaction  and  incubated  for  1  h  at  room  temperature. 
The  protein  complexes  were  washed  four  times  with 
BB750  (same  as  BB500  with  750  mM  NaCl),  and  the 
bound  proteins  were  eluted  in  5  X  Laemmli  buffer, 
resolved  by  SDS-PAGE,  and  visualized  with  Coomassie 
blue  stain. 

ICP  spectrometry 

ICP  spectrometry  was  conducted  at  Galbraith  Labora¬ 
tories,  Inc.  (Knoxville,  TN).  Two  hundred  pi  of  a  200  pM 
AJ18-KRAB(Ab)HIS  sample  (as  negative  control  for 
non-specific  HIS  tag  binding  to  zinc),  150  pi  of 


t  ftp://jgiqc.llnl.gov/ cdfit/ 


330  pM,  and  150  pi  of  33  pM  AJ18-KRAB(Ab)HIS: 
KAP-l-RBCCHIS  complex  (PBS,  500  mM  NaCl,  1  mM 
TCEP  (pH  7.0))  were  prepared  for  ICP  analysis  using  a 
wet  ash  digestion  procedure.  Measurements  were  made 
on  a  Perkin-Elmer  P2000  using  a  primary  wavelength  of 
213.856. 


Limited  proteolysis,  mass  spectrometry,  and 
N-terminal  sequence  analyses 

The  protein  complex  was  treated  with  trypsin 
protease  (sequencing  grade  modified  trypsin;  Promega) 
in  buffer  C  (20  mM  Hepes  (pH  7.5),  100  mM  NaCl, 
1  mM  p-mecaptoethanol)  in  ice  for  60  min.  The  con¬ 
centration  of  trypsin  in  the  reaction  was  0.04  pg/ pi  and 
0.008  pg/ pi,  respectively.  The  reaction  was  terminated 
with  5  mM  DFP  (trypsin  protease  inhibitor)  in  ice  for 
48  h.  Aliquots  of  each  reaction  were  analyzed  by  SDS- 
PAGE,  mass  spectrometry,  and  N-terminal  sequence 
analyses. 

Reconstitution  of  the  AJ18-KRAB(Ab):KAP-1-RBCC 
complexes  in  vitro 

To  reconstitute  the  complex,  the  AJ18-KRAB(Ab)  and 
the  KAP-l-RBCCHIS  proteins  were  purified  first  under 
native  conditions  as  described  above.  Both  proteins  were 
then  mixed  at  4  °C  for  1  h.  The  protein  complex  was 
analyzed  by  gel  filtration  and  SDS-PAGE. 


NMR  spectroscopy 

NMR  spectra  were  acquired  at  25  °C  on  a  Varian  600 
MHz  spectrometer.  ID  XH  and  2D  NOESY  spectra 
were  acquired  using  the  Watergate  sequence  for  solvent 
suppresion.  Two-dimensional  1H-15N  HSQC  spectra 
were  typically  acquired  under  the  following  conditions: 
protein  or  protein  complex  concentrations  were  1  mM 
in  buffer  PBS,  500  mM  NaCl,  1  mM  TCEP  and  10% 
(v/v)  2H2O/90%H2O.  Spectra  were  acquired  with 
either  32  scans  or  128  scans  and  processed  using 
NMRPipe.60 
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can  be  found,  in  the  online  version,  at  doi:10.1016/ 
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Abstract 


Snail  family  transcriptional  repressors  play  an  essential  role  in  epithelial  mesenchymal 
transitions  (EMT)  during  both  physiological  and  pathological  processes.  A  conserved  SNAG 
repression  domain  in  vertebrate  Snail  family  proteins  is  necessary  for  assembly  of  a  repressor 
complex,  but  the  mechanism  by  which  this  complex  assembles  is  unknown.  Here,  we  identify 
the  Ajuba  family  of  LIM  proteins  as  corepressors  of  the  Snail  family  through  interaction  with  the 
SNAG  domain.  Ajuba  LIM  proteins  interact  with  Snail  proteins  in  the  nucleus,  contribute  to 
Snail-dependent  repression  of  E-cadherin,  and  are  present  on  endogenous  E-cadherin  promoters 
in  the  presence  of  Snail.  Using  Xenopus  neural  crest  as  a  model  of  in  vivo  Snail-  or  Slug- 
induced  EMT,  we  demonstrate  that  Ajuba  LIM  proteins  contribute  to  neural  crest  development 
and  that  blocking  interaction  of  Snail  proteins  with  Ajuba  LIM  proteins  causes  a  loss  or 
reduction  in  neural  crest  development.  Together,  these  data  indicate  that  Ajuba  LIM  proteins  are 
important  Snail  corepressors  required  for  in  vivo  Snail  function.  Because  Ajuba  LIM  proteins 
contribute  to  junction  assembly  or  stability  as  components  of  adherens  junctions  in  epithelia,  the 
functional  interaction  between  Ajuba  LIM  proteins  and  Snail  proteins  in  the  nucleus  suggests 
that  Ajuba  LIM  proteins  are  important  regulators  of  epithelia  dynamics. 
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Introduction 


Developmental  processes  such  as  gastrulation  and  neural  crest  cell  delamination  require 
epithelial  cells  to  undergo  a  mesenchymal  transition  in  order  to  invade  and  migrate  (Thiery  and 
Sleeman  2006).  This  process  of  an  epithelial  to  mesenchymal  transition  (EMT)  occurs  not  only 
during  development,  but  also  during  wound  repair  (Savagner  et  al.  2005),  chronic  inflammation 
and  fibrosis  (Kalluri  and  Neilson  2003),  and  tumor  progression  from  localized  epithelial 
adenomas  to  metastatic  carcinomas  (Thiery  2002).  While  many  environmental  signals  can  induce 
EMT,  they  all  converge  to  activate  transcription  factors  that  effect  an  EMT  “gene  program.” 

This  program  requires  transcriptional  down-regulation  of  epithelial  genes,  particularly  cell-cell 
adhesive  receptors  such  as  E-cadherin,  and  up-regulation  of  mesenchymal  genes  such  as 
fibronectin.  Together,  these  changes  allow  epithelial  cells  to  lose  adherence  to  neighboring  cells 
and  begin  to  migrate  and  invade.  A  fundamental  challenge  is  to  understand  whether  and  how 
cell  surface  adhesive  events  communicate  with  nuclear  processes  to  coordinate  this  dynamic 
transition. 

Many  transcription  factors  contribute  to  EMT  through  direct  repression  of  epithelial 
genes  such  as  the  adhesive  receptors  E-cadherin,  claudins,  and  occludins.  These  include 
dEFl/ZEBl  (Guaita  et  al.  2002),  SIP1/ZEB2  (Comijn  et  al.  2001),  E47  (Perez-Moreno  et  al. 
2001),  Twist  (Yang  et  al.  2004),  Snail  (Batlle  et  al.  2000;  Cano  et  al.  2000;  Ikenouchi  et  al.  2003) 
and  Slug  (Hajra  et  al.  2002).  Snail  and  Slug  belong  to  the  Snail  family  of  zinc-finger 
transcriptional  repressors  and  have  been  well-established  as  central  regulators  of  EMT  in 
gastrulation  (Carver  et  al.  2001;  Hemavathy  et  al.  2004),  neural  crest  induction  and  delamination 
(Nieto  et  al.  1994;  LaBonne  and  Bronner-Fraser  2000;  del  Barrio  and  Nieto  2002;  Aybar  et  al. 
2003),  skin  wound  repair  (Savagner  et  al.  2005),  mesothelial  fibrosis  (Barrallo-Gimeno  and 
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Nieto  2005),  and  tumor  metastasis  (Blanco  et  al.  2002;  Barrallo-Gimeno  and  Nieto  2005;  Moody 
et  al.  2005).  Specifically,  the  repressor  function  of  Snail  is  necessary  for  Drosophila  gastrulation 
(Carver  et  al.  2001;  Hemavathy  et  al.  2004)  and  repression  by  both  Snail  and  Slug  is  necessary 
for  Xenopus  neural  crest  induction  and  delamination  (LaBonne  and  Bronner-Fraser  2000;  Aybar 
et  al.  2003).  On  a  cellular  level,  repressor  activities  of  Snail  and  Slug  influence  cell  proliferation 
and  survival  (Kajita  et  al.  2004;  Vega  et  al.  2004),  cell  adhesion,  and  cell  migration  (reviewed  in 
(Barrallo-Gimeno  and  Nieto  2005).  Snail  and  Slug  affect  these  cellular  processes  both  in  the 
presence  and  absence  of  a  concomitant  EMT  (Barrallo-Gimeno  and  Nieto  2005). 

While  Snail  and  Slug  play  integral  roles  in  development  and  disease  through  gene 
repression,  the  precise  mechanism  of  Snail-dependent  repression  remains  unclear.  It  has  been 
shown  that  repression  by  both  Snail  and  Slug  is  sensitive  to  TSA,  a  histone  deacetylase  (HD AC) 
inhibitor  (Hemavathy  et  al.  2000;  Peinado  et  al.  2004).  Indeed,  Snail  recruits  an  HD  AC 
repressor  complex  (Peinado  et  al.  2004),  but  precisely  how  it  assembles  this  complex  at  select 
promoters  is  unknown.  In  Drosophila,  Snail  associates  with  a  corepressor,  CtBP,  which  is 
necessary  to  mediate  its  repressor  activity  (Nibu  et  al.  1998).  Vertebrate  Snail  family  members, 
however,  do  not  contain  a  conserved  CtBP  binding  domain.  Instead,  they  mediate  repression 
through  a  7-32  amino  acid  N-terminal  SNAG  (Snail-Gfi-1)  domain  that  is  necessary  and 
sufficient  for  repression  (Grimes  et  al.  1996;  Nakayama  et  al.  1998;  Hemavathy  et  al.  2000).  In 
addition,  the  SNAG  domain  is  necessary  for  recruitment  of  HD  AC  proteins  and  assembly  of  a 
repressor  complex  (Peinado  et  al.  2004).  Together,  these  data  strongly  support  the  presence  of 
cellular  co-repressor(s)  that  interact  with  the  SNAG  domain  of  vertebrate  Snail,  analogous  to 
CtBP  and  Drosophila  Snail. 
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In  a  screen  to  identify  SNAG  domain  interacting  co-repressors,  we  identified  the  Ajuba 
LIM  protein  family.  Ajuba  LIM  proteins  (Ajuba,  LIMD1,  WTIP)  are  closely  related  to  Zyxin 
LIM  proteins  (Zyxin,  LPP,  Trip6).  The  Ajuba/Zyxin  family  is  characterized  by  three 
homologous  C-terminal  LIM  domains  (LIM  region)  and  a  unique  N-terminal  region  (preLIM 
region).  These  proteins  localize  to  cell-cell  or  cell-matrix  adhesion  sites  in  epithelial  and 
fibroblast  cells,  respectively,  and  influence  cell  adhesive  complex  formation  and  function  (Marie 
et  al.  2003;  Srichai  et  al.  2004;  Pratt  et  al.  2005;  Hansen  and  Beckerle  2006;  Hoffman  et  al. 

2006).  In  addition,  they  shuttle  to  and  from  the  nucleus,  suggesting  they  have  the  potential  to 
coordinate  cell  surface  adhesive  events  with  nuclear  responses  (Nix  and  Beckerle  1997;  Kanungo 
et  al.  2000;  Sharp  et  al.  2004;  Srichai  et  al.  2004). 

We  show  that  Ajuba  LIM  proteins,  but  not  related  Zyxin  LIM  proteins,  specifically 
interact  with  Snail  proteins  in  the  nucleus  and  function  as  Snail  corepressors  to  downregulate  E- 
cadherin  transcription.  Ajuba  LIM  proteins  are  recruited  to  the  endogenous  E-cadherin  promoter 
in  a  Snail-dependent  manner.  In  vivo,  Ajuba  and  LIMD1  cooperate  with  Snail  and  Slug  during 
Xenopus  neural  crest  development  and  the  expression  pattern  of  endogenous  LIMD1  parallels 
that  of  Snail  and  Slug  during  Xenopus  development.  Thus,  in  addition  to  regulating  epithelial 
cell-cell  adhesion,  Ajuba  LIM  proteins  contribute  to  epithelial-mesenchymal  transitions  as  Snail 
co-repressors  during  neural  crest  development. 
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Results 


Ajuba  LIM  proteins  interact  with  the  Snail  family  of  proteins  in  the  nucleus 

The  SNAG  domain  of  vertebrate  Snail  proteins  is  both  necessary  and  sufficient  for 
transcriptional  repressor  activity  (Nakayama  et  al.  1998;  Hemavathy  et  al.  2000),  but  precisely 
how  the  SNAG  domain  mediates  assembly  of  a  repressor  complex  remains  unclear.  To  identify 
potential  Snail  family  co-repressors,  we  performed  a  yeast  two-hybrid  protein-protein  interactive 
screen  using  the  Gfi-1  SNAG  domain  fused  to  the  LexA  DNA  binding  domain  as  bait.  From  this 
screen,  two  of  the  three  Ajuba  family  LIM  proteins  (Ajuba  and  LIMD1)  were  identified  as 
potential  interactors.  Controls  in  yeast  cells  and  mammalian  cell  lines  demonstrated  that  both 
LIM  proteins  specifically  interacted  with  a  functional  SNAG  domain.  The  biochemical 
requirements  of  the  interaction  between  Ajuba  and  the  SNAG  domain  and  the  effect  on 
repression  as  modeled  by  a  synthetic  SNAG  domain-containing  repressor  have  been  analyzed 
and  submitted  as  a  separate  manuscript  (K.  Ayyanathan  et  al.  submitted). 

To  determine  whether  Ajuba  LIM  proteins  associate  with  full-length  Snail  proteins  in 
vertebrate  cells,  HEK293  cells  were  cotransfected  with  epitope-tagged  LIM  and  Snail  proteins. 
Snail  proteins  were  immunoprecipitated  from  total  cell  lysates  and  bound  products  Western 
blotted  for  the  presence  of  LIM  protein.  Ajuba,  LIMD1,  and  WTIP  were  all  found  to  interact 
with  all  Snail  family  members  tested  (Snaill,  Slug  (Snail2),  and  Scratch)  (Fig.  1A,  top  and  IB). 
However,  closely  related  LIM  proteins,  Zyxin  and  LPP,  did  not  interact  with  Snail  or  its  family 
members  (Fig.  1A  and  IB),  indicating  that  the  interaction  with  Snail  proteins  was  specific  to  the 
Ajuba  subfamily. 

Next,  we  determined  the  region  of  Ajuba  necessary  for  binding  to  Snail  by  co¬ 
transfecting  HEK293  cells  with  myc-tagged  full-length  Ajuba,  N-terminal  preLIM  region  of 
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Ajuba,  or  C-terminal  LIM  region  of  Ajuba  (Fig.  1C)  and  Flag-tagged  Snail.  Cells  were  lysed, 
Snail  immunoprecipitated,  and  bound  products  Western  blotted  for  the  presence  of  Ajuba 
isoforms.  The  C-terminal  LIM  region  of  Ajuba,  but  not  the  preLIM  region,  interacted  with  Snail 
(Fig.  1C).  Conversely,  to  define  whether  region(s)  of  Snail  other  than  the  SNAG  domain  also 
contribute  to  its  interaction  with  Ajuba,  we  cotransfected  HEK293  cells  with  HA-tagged  full- 
length  Snail  or  Snail  lacking  the  7  amino  acid  SNAG  domain  (ASNAG)  with  myc-tagged  Ajuba 
(Fig.  ID).  While  loss  of  the  SNAG  domain  decreased  the  interaction  by  80%,  Snail  ASNAG  still 
bound  to  Ajuba,  suggesting  that  other  regions  of  Snail  could  contribute  to  the  interaction  (Fig. 
ID).  Further  determination  of  the  Snail  region(s)  that  contribute  to  the  association  with  Ajuba 
was  complicated  by  instability  of  N-terminal  and  C-terminal  deletions,  as  has  been  previously 
observed  (Peinado  et  al.  2004).  Nonetheless,  this  analysis  demonstrated  that  the  major  domain  in 
Snail  that  mediated  its  interaction  with  Ajuba  was  the  SNAG  domain. 

To  confirm  that  the  interaction  of  Ajuba  and  Snail  occurred  in  cells  containing 
endogenous  levels  of  each  protein  we  used  two  epithelial  cell  lines  that  express  endogenous 
Snail:  HaCaT  human  keratinocytes  and  MDA-231  human  breast  cancer  cells.  Snail  is  a  labile 
protein  due  to  proteosomal  degradation  following  phosphorylation  by  GSK3|3  (Zhou  et  al.  2004). 
Therefore,  we  treated  cells  with  MG  132,  a  proteasome  inhibitor,  and  LiCl,  a  GSK-3  inhibitor,  for 
five  hours  to  stabilize  endogenous  Snail  (Zhou  et  al.  2004).  Lysates  were  immunoprecipitated 
with  a  Snail  antibody  and  bound  products  Western  blotted  for  the  presence  of  Ajuba  and  LIMD1. 
Ajuba  and  LIMD1  co-immunoprecipitated  with  Snail  in  both  cell  types  (Fig.  IE),  indicating  that 
the  interaction  occurred  in  cells  with  endogenous  levels  of  each  protein. 

After  confirming  that  Ajuba  and  Snail  interact,  we  sought  to  determine  whether  these 
proteins  co-localize  in  cells.  Both  Ajuba  and  Snail  shuttle  to  and  from  the  nucleus  (Kanungo  et 


7 


al.  2000;  Dominguez  et  al.  2003;  Zhou  et  al.  2004).  When  GFP-Snail  or  RFP-Ajuba  was 
expressed  alone  in  MCF-7  human  breast  epithelial  cells,  GFP-Snail  was  predominantly  nuclear 
(95%  of  cells)  while  RFP-Ajuba  was  predominantly  cytosolic  (85-90%  of  cells).  To  determine  if 
co-expression  of  Snail  affected  the  localization  of  Ajuba,  we  cotransfected  MCF-7  cells  with 
RFP-Ajuba  in  combination  with  GFP  or  GFP-Snail.  While  the  percent  of  cells  with  nuclear 
GFP-Snail  remained  the  same  in  the  presence  or  absence  of  Ajuba,  Snail  expression  caused  a 
greater  than  three-fold  increase  in  the  percent  of  cells  with  nuclear  RFP-Ajuba.  Co-transfection 
of  RFP-Ajuba  and  GFP  resulted  in  14%  of  cells  with  nuclear  Ajuba  (similar  to  RFP-Ajuba 
alone),  while  cotransfection  of  RFP-Ajuba  and  GFP-Snail  resulted  in  53%  of  cells  with  nuclear 
Ajuba  (Fig.  2A  and  2B).  Snail  8  Ser-Ala  (Sn8SA)  is  a  mutant  form  of  Snail  that  cannot  be 
phosphorylated  by  nuclear  GSK3|3  and,  thus,  is  stabilized  and  constitutively  localized  to  the 
nucleus  (Zhou  et  al.  2004).  When  GFP-Sn8SA  and  RFP-Ajuba  were  co-expressed,  an  even 
greater  percentage  of  cells  (76%)  contained  nuclear  Ajuba  (Fig.  2B).  Since  Sn8SA  does  not  exit 
the  nucleus,  this  result  suggested  that  Snail  was  capable  of  trapping  Ajuba  in  the  nucleus. 

In  another  approach  to  assess  the  interaction  of  Ajuba  and  Snail  in  the  nucleus,  HaCaT 
cells  stably  expressing  myc-Ajuba  were  fractionated  into  nuclear  and  cytosolic  extracts.  Snail 
was  immunoprecipitated  from  both  fractions,  and  bound  products  Western  blotted  for  the 
presence  of  Ajuba.  Snail  was  only  detected  in  the  nuclear  extract  and,  therein,  Snail  associated 
with  Ajuba  (Fig.  2C).  This,  in  combination  with  the  immunofluorescence  data,  indicated  that 
Ajuba  accumulates  in  the  nucleus  upon  expression  of  Snail  and  that  it  interacts  with  Snail  in  the 
nucleus. 
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Ajuba  acts  as  a  corepressor  to  Snail 

A  central  function  of  Snail  during  EMT  is  transcriptional  repression  of  epithelial  cell-cell 
adhesive  receptors,  such  as  E-cadherin  (Batlle  et  al.  2000;  Cano  et  al.  2000).  Since  Ajuba  LIM 
proteins  interact  with  Snail  in  the  nucleus,  we  sought  to  determine  the  functional  relevance  of 
this  interaction  by  testing  whether  co-expression  of  Ajuba  affects  the  ability  of  Snail  to  repress 
transcription  downstream  of  the  E-cadherin  promoter.  Transient  luciferase  reporter  assays  were 
performed  using  a  construct  in  which  luciferase  expression  is  driven  by  a  region  of  the  human  E- 
cadherin  promoter  that  contains  all  three  Snail-binding  E-boxes.  MCF-7  human  breast  cancer 
cells  were  transfected  with  the  reporter  construct  along  with  Snail  and  Ajuba  alone  or  in 
combination.  Both  Snail  and  Ajuba  alone  were  able  to  repress  transcription  downstream  of  the 
E-cadherin  promoter  (Snail  had  a  greater  effect  than  Ajuba),  but  when  co-expressed,  there  was 
increased  repression  over  either  alone,  suggesting  that  Ajuba  cooperates  with  Snail  to  repress  E- 
cadherin  transcription  (Fig.  3A).  The  ability  of  Ajuba  alone  to  repress  transcription  could  be  due 
to  the  trace  levels  of  endogenous  Snail  in  MCF-7  cells  (data  not  shown).  Alternatively,  in 
addition  to  an  interaction  with  Snail,  Ajuba  may  contribute  to  repression  through  a  Snail- 
independent  manner  as  has  been  observed  for  the  interaction  between  other  Ajuba  LIM  proteins 
and  other  nuclear  transcriptional  regulators  (Sharp  et  al.  2004). 

Accumulated  data  on  Ajuba  function  has  revealed  that  the  LIM  and  preLIM  regions 
interact  with  distinct  targets  within  common  signaling  pathways  or  cellular  functions  (Marie  et 
al.  2003;  Pratt  et  al.  2005).  Since  Ajuba  acts  as  a  Snail  corepressor  and  the  LIM  region  directs  its 
interaction  with  Snail,  this  raises  the  possibility  that  the  preLIM  region  is  available  to  coordinate 
assembly  of  a  chromatin  repressor  complex  on  promoters  containing  bound  Snail.  If  this  is  true, 
then  overexpression  of  the  LIM  region  alone  may  act  as  a  dominant  inhibitor  to  block  the 
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assembly  of  a  repressor  complex  at  a  Snail  regulated  promoter  by  interfering  with  the  interaction 
between  endogenous  full-length  LIM  protein  and  Snail.  Indeed,  when  the  Ajuba  LIM  region  was 
overexpressed,  it  resulted  in  de-repression  of  transcription  downstream  of  the  E-cadherin 
promoter,  even  in  presence  of  exogenous  Snail  (Fig.  3A).  This  indicated  that  the  LIM  region 
could  act  in  a  dominant  negative  manner  to  inhibit  the  co-repressor  function  of  full-length  Ajuba 
LIM  proteins  and,  thus,  block  Snail-dependent  repression. 

The  biochemical  basis  of  the  LIM  region’s  potential  “dominant  inhibitory”  function 
could  be  due  to  the  LIM  region  displacing  the  endogenous  full  length  Ajuba  LIM  protein  bound 
to  Snail  or  preventing  full-length  Ajuba  LIM  proteins  from  interacting  with  Snail.  To  test 
whether  the  Ajuba  LIM  region  interferes  with  the  Snail- Ajuba  interaction,  HEK293  cells  were 
cotransfected  with  fixed  and  equal  amounts  of  Flag- Ajuba  and  HA-Snail,  and  increasing 
amounts  of  myc- Ajuba  LIM  region.  Snail  was  immunoprecipitated,  and  bound  products  Western 
blotted  for  the  presence  of  Ajuba  to  determine  if  the  interaction  between  full-length  Ajuba  and 
Snail  decreased  in  the  presence  of  excess  Ajuba  LIM  region.  Indeed  as  the  amount  of  Ajuba 
LIM  region  transfected  increased,  the  interaction  of  Snail  with  full-length  Ajuba  decreased  (Fig. 
3B).  Together,  these  results  suggested  that  interaction  with  full-length  LIM  proteins  contributes 
to  the  ability  of  Snail  to  efficiently  repress  transcription. 

For  Ajuba  to  function  as  a  Snail  co-repressor,  it  should  be  present  at  the  promoters  of 
Snail  repressed  genes,  such  as  E-cadherin,  in  a  Snail-dependent  manner.  To  determine  if  this 
was  true,  Ajuba  chromatin  immunoprecipitations  (ChIP)  were  performed  in  the  presence  or 
absence  of  Snail,  using  the  endogenous  E-cadherin  promoter  as  the  target  sequence.  As  current 
antiserum  to  Snail  does  not  immunoprecipitate  endogenous  levels  of  Snail  present  in  chromatin 
complexes,  we  were  required  to  express  Flag-tagged  Snail  so  as  to  immunoprecipitate  Snail  with 
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anti-Flag  antibodies.  HEK293  cells  stably  expressing  Flag-Snail  or  control  empty  vector  were 
generated.  Expression  of  Snail  in  these  cells  resulted  in  decreased  E-cadherin  levels,  indicating 
that  Snail  is  active  (Fig.  3C).  Nuclear  chromatin  preparations  from  these  cells  were 
immunoprecipitated  with  preimmune  serum  (control)  or  antibodies  to  Flag,  Snail,  or  Ajuba  and 
PCR  performed  for  the  E-cadherin  promoter  using  primers  that  flanked  the  three  E-boxes  Snail 
binds.  Snail  was  present  on  the  E-cadherin  promoter  in  cells  overexpressing  Flag-Snail  as  shown 
by  either  Flag  or  Snail  ChIP  (Fig.  3D,  top).  Importantly,  endogenous  Ajuba  was  detected  on  the 
E-cadherin  promoter,  but  only  in  the  presence  of  Snail  (Fig.  3D,  bottom).  This  result 
demonstrated  that  Ajuba  was  present  on  the  promoter  of  a  gene  physiologically  repressed  by 
Snail  and  that  its  presence  on  the  promoter  was  dependent  upon  the  presence  of  Snail.  In  sum, 
these  data  indicated  that  Ajuba  LIM  proteins  are  functional  Snail  co-repressors. 

Overexpression  of  Ajuba  and  LIMD1  affects  neural  crest  development  in  Xenopus 

The  roles  of  Snail  and  Slug  have  been  well-characterized  in  Xenopus  neural  crest 
development,  a  well-defined  developmental  EMT  process.  Overexpression  of  Snail  or  Slug  early 
in  Xenopus  development  leads  to  increased  neural  crest  induction  and  enhanced  migration  of 
neural  crest  precursors  during  development  (LaBonne  and  Bronner-Fraser  1998;  Carl  et  al.  1999; 
LaBonne  and  Bronner-Fraser  2000;  Aybar  et  al.  2003).  Importantly,  neural  crest  development  is 
dependent  on  the  repressor  function  of  Snail  and  Slug  (FaBonne  and  Bronner-Fraser  2000; 

Aybar  et  al.  2003).  This,  coupled  with  the  ease  of  manipulation  of  Xenopus  embryos,  led  us  to 
utilize  Xenopus  neural  crest  development  to  determine  whether  the  interaction  between  Ajuba 
FIM  proteins  and  Snail  proteins  is  biologically  relevant  in  vivo. 
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We  first  tested  the  effect  of  overexpression  of  mouse  Ajuba  and  LIMD1  on  Xenopus 
neural  crest  development.  We  confirmed  that  mAjuba  and  mLIMDl  co-immunoprecipitated 
with  XSnail  (data  not  shown).  One  cell  of  the  two-cell  stage  X.  laevis  embryo  was  injected  with 
mAjuba  or  mLIMDl  mRNA.  This  introduces  the  mRNA  into  one  bilateral  half  of  the  embryo 
while  the  other  half  serves  as  an  internal  control.  Injection  of  XSnail  or  XSlug  mRNA  served  as 
a  positive  control  for  induction  of  neural  crest.  Embryos  were  co-injected  with  (3-galactosidase 
mRNA  to  track  the  injected  regions  through  development.  The  embryos  were  fixed  and  X-Gal 
stained  at  appropriate  stages  and  in  situ  hybridization  was  performed  for  markers  of  the  neural 
plate  (Sox2),  non-neuronal  ectoderm  (Epidermal  keratin,  Epikeratin)  and  neural  crest  (Slug, 
Twist,  and  FoxD3).  We  detected  a  marked  increase  in  the  area  of  neural  crest  on  the  side  of  the 
embryos  injected  with  each  mRNA  (Fig.  4A).  XSlug  in  situs  revealed  increased  staining  on  the 
injected  side  in  37.5%  (n=24)  and  65.9%  (n=85)  of  embryos  expressing  mAjuba  or  mLIMDl, 
respectively,  as  compared  with  78.6%  (n=56)  and  59.5%  (n=42)  of  embryos  that  were  injected 
with  XSlug  or  XSnail,  respectively  (Fig.  4A).  Similar  results  were  obtained  with  other  neural 
crest  markers  Twist  and  FoxD3  (Fig.  4A).  We  did  not  observe  a  significant  change  in  Sox2  or 
Epikeratin  staining  in  embryos  injected  with  mAjuba  or  mLIMDl  mRNA  (Fig.  4B)  at  stages  14 
and  12,  respectively,  indicating  that  initial  determination  of  neural  and  epidermal  fates  occurred 
normally.  Therefore,  mAjuba  and  mLIMDl  phenocopy  the  effects  of  XSnail  and  XSlug  to 
increase  neural  crest  area. 

The  LIM  region  blocks  neural  crest  development  in  vivo 

In  another  approach  to  test  the  role  of  Ajuba  LIM  proteins  in  neural  crest  development  in 
vivo ,  we  exploited  the  dominant  inhibitory  role  of  the  LIM  region  (see  Fig.  3)  to  block  all 
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endogenous  Xenopus  Ajuba  LIM  protein  from  interacting  with  XSnail  or  XSlug.  mRNA  of  the 
LIM  region  of  either  mAjuba  or  mLIMDl  was  injected  into  one  cell  of  the  two-cell  stage 
Xenopus  embryo.  Strikingly,  expression  of  either  LIM  region  reduced  or  eliminated  Slug  or 
Twist  staining  on  the  injected  half  of  the  embryo  (Fig.  5A).  Injection  of  mAjuba  LIM  region 
mRNA  reduced  Slug  staining  on  the  injected  half  in  52%  of  embryos  (n=25),  while  injection  of 
mLIMDl  LIM  region  mRNA  reduced  Slug  staining  in  56%  of  embryos  (n=75)  and  Twist 
staining  in  71.4%  of  embryos  (n=21)  (Fig.  5  A).  This  was  not  due  to  a  nonspecific  effect  (e.g. 
affecting  cell  survival  or  all  cell  fates),  as  the  side  of  the  embryos  injected  with  LIM  region 
mRNA  still  expressed  other  ectodermal  markers  Sox2  and  Epikeratin,  and  developed  into 
tadpoles  without  apparent  defects  at  a  gross  morphological  level  (Supplemental  Fig.  1). 

To  demonstrate  specificity  of  the  LIM  region  as  a  dominant  inhibitor  of  neural  crest 
development  as  regulated  by  endogenous  Ajuba  LIM  proteins,  increasing  amounts  of  full-length 
mLIMDl  mRNA  were  co-injected  into  one  cell  of  the  two-cell  Xenopus  embryo  in  combination 
with  a  fixed  amount  of  mLIMDl  LIM  region  mRNA.  Without  co-expression  of  full-length 
LIMD1,  mLIMDl  LIM  region  caused  a  decrease  in  neural  crest  on  the  injected  side  in  56%  of 
embryos  (n=75).  Co-injection  of  equal  amounts  of  full-length  mLIMDl  reduced  this  to  32%  of 
embryos  (n=46),  while  co-injection  of  twice  the  amount  of  full-length  mLIMDl  gave  only  5.5% 
embryos  (n=54)  with  decreased  neural  crest.  Instead,  65%  of  embryos  (n=54)  showed  an 
increase  in  neural  crest  development  on  the  injected  side,  consistent  with  mLIMDl 
overexpression  (Fig.  5B). 
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Ajuba  LIM  proteins  specifically  affect  Snail -induced  neural  crest  development  in  vivo 

We  have  shown  that  overexpression  of  mAjuba  LIM  proteins  enhanced  Xenopus  neural 
crest  development,  similar  to  overexpression  of  XSnail  or  XSlug.  Moreover,  preventing 
Xenopus  Ajuba  LIM  proteins  from  binding  to  their  LIM  region  targets  (by  expression  of  the  LIM 
region  alone)  inhibited  neural  crest  development.  The  ability  of  the  Ajuba  LIM  region  to  inhibit 
neural  crest  development  could  be  due  to  blocking  endogenous  LIM  protein-Snail  interactions  or 
to  blocking  the  interaction  of  the  endogenous  LIM  protein(s)  with  other  proteins  that  influence 
neural  crest  development.  To  test  directly  whether  the  LIM  region  blocked  Snail  regulated 
neural  crest  development  specifically,  we  co-injected  mLIMDl  LIM  region  mRNA  with  XSnail 
or  XSlug  mRNA.  We  found  that  overexpression  of  the  LIM  region  was  sufficient  to  block  the 
expansion  of  neural  crest  normally  seen  upon  overexpression  of  XSnail  and  XSlug.  In  fact,  co¬ 
injection  of  the  mLIMDl  LIM  region  with  XSlug  resulted  in  an  inhibition  of  neural  crest 
development  in  90.5%  (n=24)  and  60%  (n=15)  of  embryos  as  detected  by  Slug  and  Twist 
staining,  respectively  (Fig.  6).  Similarly,  co-injection  of  the  LIM  region  with  XSnail  resulted  in 
an  inhibition  of  neural  crest  development  in  91.7%  (n=21)  and  46.7%  (n=15)  of  embryos  as 
detected  by  Slug  and  Twist  staining,  respectively  (Fig.  6).  While  this  experiment  does  not 
entirely  eliminate  the  possibility  of  affecting  other  targets  of  the  LIM  domains,  these  results  did 
indicate  that  the  interaction  of  XSnail  or  XSlug  with  full-length  Xenopus  LIM  proteins  was 
necessary  for  XSnail-  or  XSlug-induced  neural  crest  development. 

Two  Xenopus  laevis  orthologs  in  the  Ajuba  subfamily 

Expression  of  mAjuba  and  mLIMDl  mRNA  in  Xenopus  embryos  resulted  in  increased 
neural  crest  development,  consistent  with  their  potential  roles  as  Snail  family  corepressors.  We 
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sought  to  identify  endogenous  Xenopus  LIM  protein(s)  in  order  to  determine  whether  this  is  a 
physiological  role  for  Ajuba  LIM  proteins  during  Xenopus  development.  An  EST  database 
search  identified  four  putative  orthologs,  suggesting  that  the  Xenopus  families  are  less  complex 
than  the  mammalian  families.  Two  proteins  were  in  the  Ajuba  subfamily,  and  had  closest 
homology  to  mammalian  LIMD1  and  WTIP  (Fig.  7).  We  cloned  and  characterized  Xenopus 
LIMD1  (XLIMD1,  GenBank  Accession  DQ913740).  Like  its  mammalian  orthologs,  XLIMD1 
has  three  C-terminal  LIM  domains  and  a  preLIM  region.  In  the  LIM  region,  XLIMD1  has  83% 
identity  to  mLIMDl  and  66%  identity  to  mAjuba  at  the  amino  acid  level.  The  preLIM  region  of 
XLIMD1  shares  only  29%  identity  to  mLIMDl,  but  has  80%  identity  in  the  N-terminal  61  amino 
acids  (Supplemental  Fig.  2). 

We  next  determined  endogenous  XLIMD1  expression  during  Xenopus  development. 

Two  distinct  antisense  probes  to  XLIMD1  mRNA  were  used  for  in  situ  hybridization  and  gave 
the  same  result.  Sense  probes  served  as  controls  for  specificity  (Fig.  8A,  left).  Low  level,  non¬ 
specific,  or  ubiquitous  staining  of  XLIMD1  was  seen  in  blastula  and  gastrula  stage  embryos, 
however,  by  stage  13.5,  XLIMD1  was  enriched  throughout  the  neural  plate  (data  not  shown).  In 
stage  17-19  embryos,  XLIMD1  was  enriched  in  the  territory  encompassing  the  anterior  neural 
plate  and  the  premigratory  neural  crest  (Fig.  8A).  By  tailbud  stages  (Stage  21-24),  there  was 
strong  expression  of  XLIMD1  in  the  migrating  neural  crest,  as  well  as  in  the  eye,  brain,  and  otic 
placode  (Fig.  8A).  By  comparison,  Snail  and  Slug  are  expressed  in  the  prospective  neural  crest 
in  neurula  stages  and  in  migrating  neural  crest  during  and  after  stage  21  (Linker  et  al.  2000). 
Therefore  XLIMD1  expression  during  Xenopus  development  largely  overlaps  with  XSnail  and 
XSlug.  To  confirm  co-localization  in  developing  neural  crest,  double  in  situ  hybridization  was 
performed  for  XLIMD1  and  XSlug.  Expression  patterns  indeed  overlapped  in  regions  of  neural 
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crest  in  neurula  and  tailbud  stage  embryos  (Fig.  8B).  Therefore,  XLIMD1  colocalized  with 
XSnail  and  XSlug  in  areas  of  developing  neural  crest,  consistent  with  a  physiological  role  in  the 
development  of  this  tissue. 

Finally,  we  tested  whether  XLIMD1  had  the  same  effect  as  mammalian  orthologs  on 
neural  crest  development.  First,  we  confirmed  that  XLIMD1  co-immunoprecipitated  with 
XSnail  (data  not  shown).  We  next  injected  XLIMD1  mRNA  into  one  cell  of  the  two-cell 
embryo.  Injected  embryos  were  fixed  and  in  situ  hybridization  performed  for  neural  crest 
markers  Slug,  Twist  and  FoxD3.  Similar  to  mammalian  LIM  proteins,  ectopic  expression  of 
XLIMD1  resulted  in  42.5%  (n=40),  22.4%  (n=49),  and  66.7%  (n=21)  of  embryos  with  increased 
neural  crest  on  the  injected  side  as  detected  by  Slug,  Twist,  and  FoxD3  staining,  respectively 
(Fig.  9).  Therefore,  overexpression  of  both  murine  and  Xenopus  Ajuba  LIM  proteins,  like 
overexpression  of  XSnail  or  XSlug,  resulted  in  an  expansion  of  the  neural  crest  territory. 
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Discussion 


Here,  we  show  that  Ajuba  and  LIMD1  are  functional  corepressors  of  the  vertebrate  Snail 
family.  Based  on  these  results,  we  propose  a  model  in  which  Ajuba  LIM  proteins  bind  via  their 
LIM  regions  to  Snail  at  promoters  of  Snail-regulated  genes  (Fig.  10).  This  interaction  is 
mediated  predominantly  by  the  SNAG  domain  of  Snail  and  the  LIM  region  of  the  LIM  proteins, 
leaving  the  PreLIM  region  of  Ajuba  LIM  proteins  available  to  facilitate  assembly  of  the 
transcriptional  repressor  complex  at  these  sites. 

In  this  model,  the  LIM  region  alone  is  predicted  to  act  in  a  dominant  inhibitory  manner 
by  blocking  endogenous  full-length  LIM  proteins  from  binding  to  Snail,  thereby  reducing  Snail- 
dependent  repression.  In  support  of  this  model,  expression  of  excess  LIM  region  alone  prevents 
the  interaction  of  full-length  LIM  protein  with  Snail  (see  Fig.  3B)  and  gives  a  dominant 
inhibitory  phenotype,  both  in  transient  reporter  assays  (Fig.  3A)  and  in  vivo  during  Xenopus 
neural  crest  development  (Fig.  5A).  Furthermore,  the  effect  of  the  LIM  region  on  Xenopus 
neural  crest  development  can  be  rescued  by  addition  of  excess  full  length  LIMD1  (Fig.  5B), 
suggesting  specificity  of  the  action  of  the  LIM  region.  Together,  these  data  suggest  that  the  full- 
length  LIM  protein  is  necessary  for  Snail-dependent  functions.  It  remains  a  possibility  that  the 
LIM  region  has  additional  targets  in  the  nucleus,  and  we  cannot  rule  out  that  some  effects  are 
through  alternative  targets.  Nonetheless,  we  provide  strong  evidence  that  Snail  is  an  important 
nuclear  target  of  Ajuba  and  LIMD1,  and  that  Ajuba  LIM  proteins  affect  Snail  and  Slug  function 
in  the  nucleus.  Indeed,  the  LIM  region  of  LIMD1  blocked  neural  crest  development  even  in  the 
presence  of  exogenous  Snail  or  Slug  expression  (Fig.  6). 

All  members  of  the  Ajuba  subfamily  of  LIM  proteins  interact  with  Snail  family  members 
(Fig.  IB)  and  serve  redundant  functions,  as  observed  upon  expression  of  either  Ajuba  or  LIMD1 
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in  Xenopus  (Fig.  4A).  Therefore,  to  determine  a  loss  of  function  phenotype  for  these  proteins,  it 
is  likely  that  the  function  of  all  members  of  the  Ajuba  LIM  family  will  have  to  be  blocked  to 
observe  a  phenotype.  The  potential  for  compensation  between  Ajuba,  LIMD1,  and  WTIP  is 
demonstrated  by  the  fact  that  we  do  not  observe  gross  developmental  pathologies  in  the  single 
knockout  mice  of  Ajuba  or  LIMD1  (Marie  et  al.  2003;  Feng  and  Longmore,  unpublished  data). 
Here,  in  both  mammalian  cell  lines  and  Xenopus  embryos,  we  have  used  the  LIM  region  as  a 
dominant  inhibitory  protein  to  potentially  block  binding  of  all  Ajuba  LIM  family  members  to  the 
Snail  proteins.  The  use  of  the  LIM  region,  therefore,  would  prevent  compensation  that  would  be 
likely  if  only  one  LIM  subfamily  protein  were  lost.  Similarly,  others  found  that  the  two  LIM 
proteins  Zyxin  and  LPP  served  redundant  functions  in  the  recruitment  of  VASP  to  cadherin- 
based  junctions.  A  dominant  inhibitory  mutant  of  either  protein  that  failed  to  bind  VASP  could 
block  the  normal  function  of  both  Zyxin  and  LPP  in  stabilizing  nascent  epithelial  junctions 
(Hansen  and  Beckerle  2006).  It  will  be  important  to  further  explore  genetically  the  extent  of 
compensation  between  the  Ajuba  LIM  family  proteins  both  in  mice  and  in  lower  organisms. 

Because  the  LIM  region  alone  acts  in  a  dominant  inhibitory  manner,  we  hypothesize  that 
the  preLIM  region  is  important  for  the  repressor  activity  of  Ajuba,  perhaps  through  interactions 
with  other  repressor  complex  proteins.  Previously,  we  have  shown  that  the  preLIM  and  LIM 
regions  of  Ajuba  function  as  modular  domains  bringing  together  separate  proteins  to  regulate  a 
common  pathway  or  cellular  function.  For  example,  in  epithelial  cells  the  Ajuba  LIM  region 
directs  Ajuba  to  adherens  junctions  via  binding  to  a-catenin,  while  the  preLIM  region  interacts 
directly  with  F-Actin,  providing  a  link  between  the  junctions  and  the  cytoskeleton  (Marie  et  al. 
2003).  In  fibroblasts,  the  LIM  region  directs  Ajuba  to  focal  adhesions  while  the  preLIM  region 
interacts  with  p!30cas,  allowing  for  localized  activation  of  Rac  during  cell  migration  (Pratt  et  al. 
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2005).  It  will  be  critical  to  define  the  role  of  the  preLIM  region  in  Snail-mediated  repression  in 
future  experiments. 

While  Ajuba  and  its  family  members  do  not  directly  bind  DNA,  it  is  becoming 
increasingly  clear  that  through  protein-protein  interactions  with  known  transcription  factors 
many  LIM  domain  containing  proteins,  including  members  of  the  Ajuba  and  Zyxin  families,  can 
regulate  expression  of  genes  in  a  variety  of  cells.  For  example,  LIMD1  was  recently  identified 
as  a  corepressor  of  the  retinoblastoma  protein  (Sharp  et  al.  2004),  WTIP  inhibits  transcriptional 
activation  by  WT-1  (Srichai  et  al.  2004),  LPP  is  a  coactivator  of  PEA3,  an  ETS  domain 
transcription  factor  (Guo  et  al.  2006),  and  TRIP6  enhances  transactivation  by  v-Rel  (Zhao  et  al. 
1999).  As  we  have  observed  for  the  interaction  between  Ajuba  LIM  proteins  and  Snail  proteins, 
in  most  cases,  the  LIM  region  directs  the  interaction  with  the  DNA  binding  protein.  This  then 
leaves  the  preLIM  region  available  to  potentially  coordinate  the  assembly  of  repression  or 
activation  complexes  at  these  sites. 

Snail  family  members  contribute  to  many  biological  and  cellular  processes  including 
gastrulation  and  neural  crest  development  as  well  as  cell  proliferation,  survival,  adhesion,  and 
migration  (Barrallo-Gimeno  and  Nieto  2005).  Precise  mechanisms  of  regulation  of  these 
different  roles  in  various  cell  types  and  organisms  remain  unclear.  Using  Xenopus  neural  crest 
development  as  a  model  for  Snail  or  Slug  function,  we  show  that  Ajuba  LIM  proteins  cooperate 
with  Snail  and  Slug  in  vivo.  Overexpression  of  Ajuba  LIM  proteins  gives  increased  neural  crest, 
similar  to  overexpression  of  XSnail  or  XSlug  (Lig.  4A).  This  role  in  Xenopus  is  consistent  with 
Ajuba  and  LIMD1  acting  as  corepressors  of  Snail  and  Slug  as  the  repressor  activity  of  Snail  and 
Slug  has  been  shown  to  be  essential  for  their  roles  in  neural  crest  development  (LaBonne  and 
Bronner-Lraser  2000;  Aybar  et  al.  2003).  Importantly,  blocking  the  interaction  of  full-length 
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Ajuba  LIM  proteins  with  Snail  by  expressing  the  Ajuba  or  LIMD1  LIM  regions  alone  blocks 
neural  crest  development  (Fig.  6).  This  also  results  in  a  loss  of  at  least  one  neural  crest 
derivative,  melanocytes,  at  later  stages  (see  Supplemental  Fig.  1).  The  genes  directly  repressed 
by  Snail  and  Slug  in  Xenopus  are  unknown,  but  as  these  are  uncovered,  it  will  be  important  to 
determine  whether  Ajuba  LIM  proteins  affect  all,  or  a  subset,  of  these  genes.  In  addition,  it  will 
be  interesting  to  expand  these  studies  to  determine  the  extent  to  which  Ajuba  LIM  proteins  affect 
Snail  function  in  other  processes. 

Here,  we  describe  a  nuclear  role  for  Ajuba  LIM  proteins  as  corepressors  with  Snail  that 
contribute  to  repression  of  E-cadherin.  Previously,  Ajuba,  LIMD1,  and  WTIP  all  have  been 
shown  to  localize  to  cell-cell  adhesive  junctions  in  epithelial  cells  (Marie  et  al.  2003;  Srichai  et 
al.  2004).  Ajuba  contributes  to  adherens  junction  formation  and/or  stability,  in  part  by  coupling 
the  E-cadherin  adhesive  complex  to  the  actin  cytoskeleton  (Marie  et  al.  2003).  Because  of  these 
distinct  roles  for  the  separate  pools  of  Ajuba,  it  may  play  an  integral  role  in  communicating  from 
adhesive  complexes  to  the  nucleus  to  provide  precise  regulation  of  this  process.  While  we  do  not 
have  evidence  that  Ajuba  LIM  proteins  can  initiate  EMT,  we  do  show  that  expression  of  Snail 
can  result  in  the  trapping  of  Ajuba  in  the  nucleus  (Fig.  2A  and  2B).  Once  in  the  nucleus,  Ajuba 
LIM  proteins  can  act  as  co-repressors  to  Snail  family  members,  and  could  contribute  to  a  feed 
forward  loop  to  maintain  the  mesenchymal  phenotype.  By  contributing  to  repression  of  E- 
cadherin,  Ajuba  may  also  indirectly  allow  more  Ajuba  to  be  available  for  entry  into  the  nucleus 
(as  Ajuba  is  normally  present  in  E-cadherin  dependent  adherens  junctions).  Consistent  with  this, 
Jamora,  et  al.  found  that  Ajuba  is  released  from  adherens  junctions  upon  expression  of  Snail  and 
subsequent  transcriptional  downregulation  of  E-cadherin  expression  (Jamora  et  al.  2005).  We 
have  also  observed  the  release  of  Ajuba  from  epithelial  junctions  in  HaCaT  epithelial  cells  upon 
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treatment  with  TGF-p,  which  induces  EMT  (Langer  and  Longmore,  unpublished  data).  In  sum, 
Ajuba  LIM  proteins  could  play  an  important  role  in  the  regulation  of  EMT  by  providing 
communication  between  adherens  junctions  and  the  nucleus  during  this  dynamic  transition. 

It  is  also  of  interest  to  determine  whether  Ajuba  LIM  proteins  play  a  role  in  the  reverse 
process,  a  mesenchymal  to  epithelial  transition  (MET).  MET  occurs  physiologically  during 
mammalian  nephrogenesis  (Kanwar  et  al.  2004)  and  pathologically  upon  redifferentiation  of 
metastatic  cells  as  they  reform  epithelial-like  tumors  at  metastatic  sites  (Thiery  and  Sleeman 
2006).  Recently,  Snail  was  shown  to  be  capable  of  repressing  its  own  transcription  (Peiro  et  al. 
2006),  perhaps  providing  a  mechanism  to  initiate  MET.  As  Snail  expression  is  decreased,  Ajuba 
could  exit  the  nucleus,  be  recruited  to  newly  forming  adherens  junctions,  and  thereby  contribute 
to  epithelia  formation. 
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Materials  and  Methods 


Cell  Culture,  Transfection,  and  Antibodies 

HEK293T,  MCF-7,  HaCaT,  and  MDA-231  cells  were  grown  in  DMEM  supplemented 
with  10%  FBS,  2mM  L-Glutamine,  100  units/ml  penicillin  and  lOOug/ml  streptomycin.  Stable 
cell  lines  of  HaCaT  expressing  myc-Ajuba  were  selected  and  grown  in  G418  and  HEK293T  cell 
expressing  Snail  were  selected  and  grown  in  Zeocin.  All  transfections  were  performed  with 
Trans-IT  LT-1  (Mims)  according  to  the  manufacturer’s  protocol.  Antibodies  used  were:  Ajuba 
(Cell  Signaling,  and  (Pratt  et  al.  2005)),  Snail  (Santa  Cruz),  and  HA  (Sigma).  HRP  conjugated 
antibodies  to  Flag  and  Myc  tags  were  from  Sigma. 

Plasmids 

pCS2-myc-mAjuba,  pCS2-myc-mFIMDl,  pCS2-myc-FPP,  pCS2-myc-Zyxin,  pCS2- 
myc-mAjubaPreFIM,  pCS2-myc-mAjubaFIM,  and  pcDNA3.1-RFP-mAjuba  have  been 
described  previously  (Goyal  et  al.  1999;  Marie  et  al.  2003;  Pratt  et  al.  2005).  pCMV14-mSnail- 
Flag  was  a  kind  gift  from  H.  F.  Grimes  (University  of  Fouisville).  pCMV5-HA-mSnail  full- 
length  and  ASNAG  constructs  were  generated  through  PCR  into  the  pCMV5-HA  vector. 
pcDNA3.1-GFP-hSnailWT  and  pcDNA3.1-GFP-hSnail8SA  were  subcloned  into  pcDNA3.1 
(Invitrogen)  from  the  corresponding  CMV-Tag  constructs,  which  were  from  M.  Hung 
(University  of  Texas  M.D.  Anderson  Cancer  Center).  pCS2-XSlug  was  from  C.  FaBonne 
(Northwestern  University)  and  pCS2-myc-XSnail  was  constructed  by  PCR  from  an  X.  laevis 
cDNA  library.  All  subcloning  was  confirmed  by  DNA  sequencing. 

Immunoprecipitation,  Immunoblots,  and  Immunofluorescence 

Cells  were  lysed  in  IP  buffer  (lOmM  Tris  pH7.4,  150mM  NaCl,  ImM  EDTA  lOmM 
NaF,  10%  glycerol,  1%  NP-40,  protease  inhibitors)  and  extract  clarified  by  spinning  at 
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12,000rpm  for  20  minutes.  Lysates  were  pre-cleared  with  protein  A  or  G  beads  alone  for  1  hour, 
then  incubated  overnight  with  antibody.  Protein  A  or  G  beads  were  added  for  1  hour,  washed  4 
times  with  IP  buffer,  and  bound  proteins  eluted  directly  into  SDS-PAGE  sample  buffer.  For  Flag 
M2  conjugated  beads  (Sigma)  were  used  for  immunoprecipitation  and  bound  proteins  eluted  by 
competition  with  Flag  peptide.  SDS-PAGE  and  immunoblots  were  performed  using  standard 
protocols.  For  nuclear  and  cytosolic  extracts,  HaCaT  cells  were  lysed  in  a  hypotonic  buffer,  cell 
membranes  disrupted  by  dounce  homogenization,  and  nuclei  pelleted  by  centrifugation. 
Supernatant  was  used  as  cytosolic  extract.  Nuclei  were  then  extracted  in  hypertonic  lysis  buffer 
and  centrifuged  again.  Extracts  were  adjusted  to  isotonic,  and  immunoprecipitations  performed. 

Immunofluorescence  was  performed  as  previously  described  (Marie  et  al.  2003).  Images 
were  taken  on  a  Zeiss  confocal  microscope  using  the  LSM  550  software.  Localization  of  Ajuba 
was  quantified  by  counting  at  least  100  cells  co-transfected  with  GFP  and  RFP  constructs. 
Luciferase  Assay 

MCF-7  cells  were  transiently  transfected  with  an  0. 1 2ug  E-cadherin-luciferase  reporter 
construct  (Hajra  et  al.  2002),  0.12qg  CMV-|3-galactosidase  and  combinations  of  empty  vector, 
mAjuba,  mAjuba  LIM,  and  mSnail  to  equal  1 ,2ug  total  DNA.  Cells  were  lysed  in  Cell  Culture 
Lysis  Reagent  (Promega),  and  lysate  used  in  both  a  luminescent  (3-galactosidase  detection  assay 
(BD  Biosciences)  and  luciferase  assay.  All  samples  were  read  on  a  luminometer  and  values 
shown  were  obtained  by  normalizing  luciferase  values  to  (3-galactosidase  activity. 

Chromatin  Immunoprecipitation 

Cells  were  grown  in  150mm  plates  to  70-90%  confluency,  fixed  in  1%  formaldehyde, 
and  harvested,  and  chromatin  immunoprecipitations  performed  using  the  EZ-CHIP  kit  (Upstate) 
according  to  the  manufacturer’s  protocol.  PCR  was  performed  on  immunoprecipitated  DNA 
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using  the  following  primer  pairs:  5’  AATCAGAACCGTGCAGGTCC  3’  and  5’ 

AC AGGT GCTTT GC AGTT CCG  3’  (see  Figure  3D) 

X.  laevis  embryos  and  mRNA  injections 

X.  laevis  albino  embryos  were  obtained  through  in  vitro  fertilization  and  raised  as 
described  previously  (Kroll  et  al.  1998).  Embryos  were  staged  following  Nieuwkoop  and  Faber 
(Nieuwkoop  and  Faber,  1967).  For  injections,  capped  mRNA  was  transcribed  in  vitro  using  the 
mMessage  mMachine  kit  (Ambion).  RNAs  (250-500pg)  were  coinjected  into  one  blastomere  of 
two-cell  stage  embryos  with  (3-galactosidase  mRNA  (50pg). 

In  situ  hybridizations 

Embryos  were  raised  until  indicated  stages,  fixed  for  1  hour  in  MEMFA,  X-gal  (5- 
bromo-4-chloro-3-indolyl  (3-galactopyranoside)  stained  and  analyzed  by  in  situ  hybridizations  as 
described  (Harland  1991).  For  XFIMD1  in  situs,  two  IMAGE  clones  of  XFIMD1,  5513507 
(CF271778)  and  6640526  (BU914068),  that  overlap  the  preFIM  region  (nts  1-573)  and  FIM 
region  (nts  1330-1836)  were  used.  Sense  and  antisense  in  situ  probes  of  each  XFIMD1  clone 
and  antisense  probes  of  Sox2  (Mizuseki  et  al.  1998),  Epidermal  keratin  (Epikeratin)  (Jonas  et  al. 
1985),  Slug  (Mayor  et  al.  1995),  Twist  (Hopwood  et  al.  1989),  and  FoxD3  (Bellmeyer  et  al. 
2003)  were  generated  by  in  vitro  transcription  with  digoxigenin- 1 1-UTP  (Roche).  Probes  were 
detected  with  alkaline  phosphatase  (AP)-conjugated  anti-digoxigenin  antibodies  (Roche)  with 
Nitro  blue  tetrazolium/5-bromo-4-chloro-3-indolyl-phosphate  (NBT/BCIP;  Roche).  For  double 
in  situs,  the  Dig-labeled  XFIMD1  probe  (6640526)  was  used  in  combination  with  a  probe  to 
Slug  generated  by  in  vitro  transcription  with  fluorescein- 12-UTP  (Roche).  Probes  were  detected 
using  AP-anti-digoxenin  (Roche)  with  BCIP  (Roche)  and  AP-anti-fluorescein  (Roche)  with 
Magenta  phosphate  (5-bromo-6-chloro-3-indoxyl  phosphate,  p-toluidine  salt;  Biosynth). 


24 


Identification  ofX.  laevis  orthologs  and  cloning  ofX.  laevis  LIMD1 

An  EST  database  search  revealed  partial  sequences  overlapping  the  LIM  regions  of  Zyxin 
(BU907427,  BG234393,  BJ068010,  BI349606,  and  BG409548),  LPP  (BC077865,  BU903395, 
CB983325,  BG363738,  BU912724,  BM179756,  and  BG364767),  LIMD1  (BU914068  and 
AW6421 19),  and  WTIP  (BJ068924  and  AW645564).  Phylogenetic  relationships  between  the 
LIM  regions  of  M.  musculus,  D.  melanogaster,  C.  elegans,  and  putative  X.  laevis  LIM  proteins 
were  generated  using  the  Clustal  W  algorithm  and  MegAlign  software  (DNASTAR).  The  full 
LIM  region  was  used  for  all  proteins  except  X.  laevis  WTIP,  where  only  two  of  three  LIM 
domains  have  been  identified.  Gene-specific  primers  to  full  length  XLIMD1  were  designed 
based  on  available  EST  sequences  (CL271778  and  BU914068).  With  these,  the  full-length 
XLIMD1  cDNA  was  amplified  from  a  X.  laevis  stage  24  embryo  cDNA  library  by  PCR.  This 
fragment  was  cloned  into  pCS2  and  sequenced  (Gen  Bank  Accession  DQ9 13740). 
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Figure  Legends 

Figure  1.  Ajuba  LIM  proteins  interact  with  the  Snail  family  of  transcriptional  repressors. 

A.  Myc-tagged  LIM  proteins  and  Flag-tagged  Snail  were  cotransfected  into  HEK293  cells. 

Snail  was  immunoprecipitated  from  lysates  (anti-Flag)  and  bound  products  Western  blotted  for 
LIM  protein  (anti-myc)  and  Snail  (anti-Flag).  Control  Western  blot  of  lysate  is  on  right  panel  of 
each  set.  B.  Table  summarizing  the  interaction  between  Snail  family  members  and  Ajuba  or 
Zyxin  family  members,  as  determined  using  the  method  described  in  (A).  ND  -  not  determined. 
C.  Top:  Schematic  diagram  of  Ajuba  constructs  used.  NES  -  nuclear  export  sequence.  Bottom: 
Co-immunoprecipitation  experiments  as  described  for  (A).  D.  Top:  Schematic  diagram  of  full 
length  and  mutant  Snail  constructs  used.  Bottom:  Myc-tagged  Ajuba  and  HA-tagged  Snail 
constructs  were  cotransfected  into  HEK293  cells.  Snail  was  immunoprecipitated  (anti-HA)  and 
bound  products  Western  blotted  for  Ajuba  (anti-myc)  and  Snail  (anti-HA).  Control  Western  blot 
of  lysate  is  shown  on  the  left.  The  amount  of  Ajuba  immunoprecipitated  relative  to  input  was 
quantified  by  densitometry  and  controlled  for  the  amount  of  Snail  immunoprecipitated.  (Ajuba 
immunoprecipitated  with  full-length  Snail  was  arbitrarily  set  to  equal  1).  E.  Endogenous  Snail 
was  immunoprecipitated  from  lysates  of  HaCaT  or  MDA-231  cells  and  bound  products  Western 
blotted  for  the  presence  of  Ajuba,  LIMD1  and  Snail.  Controls  include  pulldown  with  Protein  G 
beads  alone  and  lysate  input. 

Figure  2.  Ajuba  interacts  with  Snail  in  the  nucleus.  A.  Confocal  immunofluorescence 
analysis  of  MCF-7  breast  cancer  cells  transfected  with  RFP- Ajuba  in  combination  with  GFP 
alone  or  GFP-Snail.  Arrows  indicate  cotransfected  cells.  B.  Quantification  of 
immunofluorescence  results.  Shown  are  the  percent  of  cells  in  which  RFP- Ajuba  is  localized  to 
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the  cytosol  (white  bars)  or  nucleus  (black  bars)  in  the  presence  of  GFP,  GFP-SnailWT  or  GFP- 
Snail8SA.  For  each  sample,  at  least  100  cells  were  counted.  The  experiment  was  repeated  three 
times  with  similar  results.  Shown  is  one  representative  experiment.  C.  Myc-Ajuba  was  stably 
overexpressed  in  HaCaT  epithelial  cells.  Endogenous  Snail  was  immunoprecipitated  from 
nuclear  extracts  (NE)  or  cytosolic  extracts  (CE)  of  these  cells  and  bound  products  Western 
blotted  for  the  presence  of  Ajuba  (anti-myc)  and  Snail.  Control  immunoprecipitation  was 
performed  with  rabbit  preimmune  sera  (PI).  Western  blot  of  input  controls  is  shown  on  the  left. 

Figure  3.  Ajuba  is  a  corepressor  of  Snail.  A.  Transient  luciferase  reporter  assay  using 
luciferase  driven  by  the  human  E-cadherin  promoter.  Constructs,  as  indicated,  were  expressed  in 
MCF-7  cells  and  luciferase  activity  determined  and  normalized  to  (3-galactosidase  activity  (from 
co-transfected  CMV-p-galactosidase  construct).  Experiments  were  performed  in  triplicate. 
Shown  are  mean  normalized  luciferase  values  +/-  standard  deviation.  B.  HEK293  cells  were  co¬ 
transfected  with  Flag- Ajuba,  HA-Snail,  and  myc-Ajuba  LIM  region  as  indicated.  Snail  was 
immunoprecipitated  from  lysates  (anti-HA)  and  bound  products  Western  blotted  for  full-length 
Ajuba  (anti-Flag),  Snail  (anti-HA),  and  Ajuba  LIM  region  (anti-myc).  The  amount  of  Ajuba 
immunoprecipitated  relative  to  input  was  quantified  by  densitometry  and  controlled  for  the 
amount  of  Snail  immunoprecipitated.  The  value  obtained  for  lane  4  was  arbitrarily  set  to  equal 
1.  C.  HEK293  cells  were  stably  transfected  with  vector  or  Flag-Snail.  Lysates  of  cells  were 
immunoblotted  for  E-cadherin,  Flag,  and  Ajuba.  Tubulin  is  a  loading  control.  D.  ChIPs  were 
performed  in  HEK293  cells  stably  transfected  for  vector  or  Flag-Snail  with  antibodies  to  Flag  or 
Snail  (top  panel)  and  Ajuba  (bottom  panel).  Preimmune  (PI)  sera  was  used  as  a  control.  PCRs 
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were  performed  using  primers  flanking  the  three  E-boxes  in  the  E-cadherin  promoter.  In 
schematic,  A,  B,  and  C  represent  E-boxes  on  the  E-cadherin  promoter. 

Figure  4.  Ajuba  and  LIMD1  specifically  affect  neural  crest  development  in  Xenopus.  A. 

X.  laevis  embryos  were  co-injected  with  (3-gal  and  either  mAjuba,  mLIMDl,  XSlug,  or  XSnail 
capped  mRNAs  in  one  cell  of  the  two-cell  stage  embryo.  Embryos  were  fixed  at  stage  18-19  and 
in  situ  hybridization  performed  for  XSlug,  XT wist,  or  XFoxD3.  Shown  are  dorsal  views  with 
anterior  end  oriented  up.  X-Gal  staining  shows  injected  region  of  the  embryo  (oriented  to  the 
right).  Numbers  in  the  lower  right-hand  corner  indicate  the  number  of  embryos  that  display  the 
phenotype  shown  over  the  number  of  embryos  injected.  B.  X.  laevis  embryos  were  co-injected 
with  (3-gal  and  either  mAjuba  or  mLIMDl  and  fixed  at  stage  12  (top)  and  14  (bottom)  and  in  situ 
hybridization  performed  for  Epikeratin  and  Sox2,  respectively.  Top  panels  are  in  side  view  with 
blastopore  oriented  left  and  bottom  panels  are  dorsal  views  with  anterior  oriented  up.  X-Gal 
staining  shows  injected  region  of  the  embryo. 

Figure  5.  The  LIM  region  of  Ajuba  and  LIMD1  block  neural  crest  development  in 
Xenopus.  A.  X.  laevis  embryos  were  co-injected  with  (3-gal  and  either  mAjuba  LIM  region  or 
mLIMDl  LIM  region  capped  mRNA,  fixed  at  stage  18-19,  and  in  situ  hybridized  for  XSlug  and 
XTwist.  Shown  are  dorsal  views  with  anterior  oriented  up.  X-Gal  staining  shows  injected 
region  of  the  embryo  (oriented  to  the  right).  Numbers  in  the  lower  right-hand  corner  show  the 
number  of  embryos  that  display  the  phenotype  shown  over  the  number  of  embryos  injected.  ND 
-  not  determined.  B.  X.  laevis  embryos  were  co-injected  with  (3-gal  and  combinations  of 
mLIMDl  LIM  region  +/-  full-length  mLIMDl  as  shown.  Embryos  were  fixed  at  stage  18-19 
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and  in  situ  hybridization  performed  for  Slug.  Graph  shows  percent  of  embryos  with  decreased 
Slug  (black  bars)  or  increased  Slug  (white  bars)  staining  on  the  injected  half  of  the  embryo. 

Figure  6.  The  LIM  region  of  Ajuba  or  LIMD1  specifically  blocks  Snail-  or  Slug-induced 
neural  crest  development  in  Xenopus.  X.  laevis  embryos  were  co-injected  with  (3-gal, 
mLIMDl  LIM  region,  and  either  XSlug  or  XSnail  capped  mRNA.  Embryos  were  fixed  at  stage 
18-19  and  in  situ  hybridization  performed  for  XSlug  and  XT wist.  Shown  are  dorsal  views  with 
anterior  oriented  up.  X-Gal  staining  shows  injected  region  of  the  embryo  (oriented  to  the  right). 
Numbers  in  the  lower  right-hand  corner  show  the  number  of  embryos  that  display  the  phenotype 
shown  over  the  number  of  embryos  injected. 

Figure  7.  X.  laevis  orthologs  of  the  Ajuba  and  Zyxin  families  of  LIM  proteins.  A  predicted 
phylogenetic  relationship  of  the  LIM  regions  of  the  C.  elegans,  Drosophila,  mouse  and  putative 
X.  laevis  orthologs  of  the  Ajuba/Zyxin  family  of  LIM  proteins.  *Lor  X.  laevis  WTIP,  only  the 
first  two  LIM  domains  of  the  LIM  region  have  been  identified. 

Figure  8.  Endogenous  XLIMD1  expression  pattern  overlaps  that  of  XSlug.  A.  In  situ 
hybridization  was  performed  on  uninjected  X.  laevis  embryos  using  sense  and  antisense  probes 
for  XLIMD1.  Staining  patterns  at  stages  19,  21,  and  24  are  shown.  Arrowheads  indicate 
XLIMD1  staining  in  the  premigratory  (Stage  19)  and  migratory  (Stages  21  and  24)  neural  crest. 
Embryos  are  oriented  with  dorsal  up  and  anterior  to  the  right.  B.  Double  in  situ  hybridization 
for  XLIMD1  and  XSlug  in  X.  laevis  embryos  at  the  stages  shown.  XLIMD1  localization  alone  is 
shown  in  blue  (left  column)  and  XSlug  is  overlayed  in  pink  on  right.  Arrows  indicate  areas  of 
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colocalization  in  neural  crest  derivatives.  Embryos  are  oriented  with  dorsal  up  and  anterior  to 
the  right. 

Figure  9.  XLIMD1  expression  results  in  an  increase  in  neural  crest  development.  X.  laevis 
embryos  were  co-injected  with  (3-gal  and  XLIMD1  capped  mRNA  in  one  cell  of  the  two-cell 
stage  embryo.  Embryos  were  fixed  at  stage  18-19  and  in  situ  hybridization  performed  for  XSlug, 
XT wist,  or  XFoxD3.  X-Gal  staining  (oriented  to  the  right)  shows  the  injected  region  of  the 
embryo.  Numbers  in  the  lower  right-hand  corner  indicate  the  number  of  embryos  that  display 
the  phenotype  shown  over  the  number  of  embryos  injected. 

Figure  10.  Model  of  Ajuba  protein  function  as  a  Snail  co-repressor.  A.  Ajuba  interacts  with 
Snail  bound  at  a  promoter  to  enhance  Snail-dependent  repression.  The  LIM  region  of  Ajuba 
LIM  proteins  direct  their  interaction  with,  predominantly,  the  SNAG  domain  of  Snail.  The 
preLIM  region  is  free  to  potentially  function  in  recruitment  of  repressor  complex  proteins  so  as 
to  allow  for  more  efficient  repression  by  Snail.  B.  In  the  presence  of  excess  LIM  region, 
endogenous  full-length  Ajuba  LIM  proteins  are  prevented  from  interacting  with  Snail.  As  a 
result,  Snail-dependent  repression  is  not  as  efficient. 
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SUMMARY 


Tandem  PHD  and  bromodomains  are  often  found  in  chromatin-associated  proteins  and  have 
been  shown  to  cooperate  in  gene  silencing.  While  both  domains  can  bind  specifically  modified 
histones,  the  mechanisms  of  cooperation  between  these  domains  are  unknown.  We  show  that  the 
PHD  domain  of  the  KAP1  corepressor  functions  as  an  intramolecular  E3  ligase  for  sumoylation 
of  the  adjacent  bromodomain.  The  RING  finger- like  structure  of  the  PHD  domain  is  required  for 
both  Ubc9  binding  and  sumoylation  and  directs  modification  to  specific  lysine  residues  in  the 
bromodomain.  Sumoylation  is  required  for  KAP1  mediated  gene  silencing  in  vivo  and  functions 
by  directly  recruiting  the  SETDB 1  histone  methyltransferase  and  the  CHD3/Mi2  alpha 
component  of  the  NuRD  complex  via  highly  conserved  SUMO  interacting  motifs  (SIMs). 
Furthermore,  sumoylated  KAP1  stimulates  the  histone  methyltransferase  activity  of  SETDB  1. 
Our  data  provide  a  novel  mechanistic  explanation  for  cooperation  of  PHD  and  bromodomains  in 
gene  regulation  and  describe  a  new  function  of  the  PHD  domain  as  an  E3  SUMO  ligase. 
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INTRODUCTION 


Eukaryotic  gene  regulation  requires  the  coordination  of  multiple  enzymatic  activities  that 
target  nucleosomal  histones  for  post-translational  modification.  Protein  domains  which  recognize 
specific  histone  modifications  are  key  components  of  this  mechanism  (Seet  et  al.,  2006).  The 
bromodomain  recognizes  acetylated  lysines  in  histone  H3  and  H4  (Yang,  2004).  The  PHD 
domain  of  ING2  and  BPTF  binds  methylated  histone  H3  (Li  et  al.,  2006;  Shi  et  al.,  2006; 
Wysocka  et  al.,  2006).  The  PHD  and  bromodomain  are  often  found  adjacent  to  each  other 
suggesting  that  they  may  form  a  cooperative  module  used  to  interpret  combinatorial 
modifications  of  histones.  Cooperation  of  the  PHD  and  the  bromodomain  occurs  in  p300,  TIP5, 
BPTF  and  KAP1  proteins  (Ragvin  et  al.,  2004;  Schultz  et  al.,  2001;  Wysocka  et  al.,  2006;  Zhou 
and  Grummt,  2005). 

KAP1/TIF1  beta  is  a  universal  corepressor  for  the  largest  family  of  transcriptional 
silencers  encoded  in  the  human  genome,  KRAB  domain  containing  zinc  finger  proteins  (ZFPs) 
(Huntley  et  al.,  2006).  The  KAP1  N-terminal  RBCC  region  is  responsible  for  KRAB  domain 
binding  (Friedman  et  al.,  1996;  Peng  et  al.,  2000),  while  HP1  binding  domain  and  PHD- 
Bromodomain  tandem  are  required  for  gene  silencing.  The  KAP1  PHD-Bromodomain  is 
required  for  recruitment  of  the  H3-K9  specific  HMTase  SETDB1  and  CHD3  proteins  to  the 
promoter  regions  of  KRAB  modulated  genes  (Schultz  et  al.,  2002;  Schultz  et  al.,  2001;  Sripathy 
et  al.,  2006).  Mutations  in  either  the  PHD  or  bromodomains  compromise  KAP1  interaction  with 
these  partner  proteins  and  relieve  repression,  suggesting  that  they  act  cooperatively.  The  PHD 
domain  of  KAP1  is  highly  related  to  the  RING  finger  (Capili  et  al.,  2001),  often  found  in 
ubiquitin  and  SUMO  E3  ligases  (Joazeiro  and  Weissman,  2000). 
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SUMO  family  proteins  are  conjugated  to  target  lysines  via  a  cascade  of  El  activating,  E2 
transfer  and  E3  ligase  enzymes  (Johnson,  2004).  The  SUMO  E2  protein  Ubc9  often  recognizes 
the  consensus  sequence  ipKxE/D  (ip  -  hydrophobic)  in  the  target  protein  and  catalyzes  SUMO 
conjugation  (Sampson  et  al.,  2001)  leading  to  alterations  in  protein  localization  and  function.  A 
family  of  deconjugation  enzymes,  SENP1-7,  is  responsible  for  rapid  removal  of  SUMO  from 
target  lysines  (Mukhopadhyay  and  Dasso,  2007)  which  accounts  for  the  very  temporal  nature  of 
this  modification.  Sumoylation  is  emerging  as  a  key  posttranslational  modification  involved  in 
transcriptional  repression  (Gill,  2005),  however,  the  detailed  mechanisms  of  its  action  remain 
obscure.  SUMO  modification  can  enhance  recruitment  of  repression  machinery,  including 
HDACs  (Girdwood  et  al.,  2003;  Yang  and  Sharrocks,  2004)  and  may  also  act  to  preclude  other 
lysine  modifications  leading  to  impaired  protein-protein  interactions. 

Here,  we  show  that  the  KAP1  PHD  domain  binds  to  Ubc9  and  directs  SUMO 
conjugation  of  an  adjacent  bromodomain,  and  that  sumoylation  is  required  for  KRAB-KAP1 
mediated  repression.  The  tandem  PHD-Bromodomain  of  KAP1  functions  as  an  intramolecular 
SUMO  E3  ligase.  SETDB 1  and  CHD3  encode  functional  SIM  motifs  and  only  bind  SUMO- 
modified  KAP1.  Thus,  sumoylation  mediated  repression  occurs  via  direct  recruitment  of  H3-K9 
HMTase  and  HD  AC  activities. 
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RESULTS 


KAP1  is  Sumoylated  In  Vivo  and  In  Vitro 

SUMO  is  often  conjugated  to  lysine  residues  within  the  consensus  rpKxE/D.  The  KAP1 
sequence  contains  six  lysines  conforming  to  this  consensus:  canonical  sites  at  K554  and  K676, 
and  four  reverse  sites  at  K575,  K750,  K779  and  K804  (Figure  1  A,B).  Immunoprecipitation  of 
endogenous  KAP1  with  two  different  antibodies  under  conditions  preserving  the  SUMO 
modification  revealed  a  ladder  of  SUMO  1 -modified  KAP1  species  (Figure  1C).  Expression  of 
SUMOl  and  Ubc9  or  a  dominant  negative  SENP1  greatly  enhanced  KAPl-SUMOl  conjugation, 
whereas  expression  of  wild  type  SENP1  prevented  it  (Figure  ID).  KAP1  sumoylation  was  also 
efficiently  reconstituted  in  vitro  with  purified  proteins.  Reaction  containing  all  components 
produced  a  ladder  of  slower  migrating  KAP1  forms  consistent  with  conjugation  of  multiple 
SUMOl  moieties.  Reactions  that  lacked  either  El  or  Ubc9  did  not  show  sumoylation  products 
(Figure  IE).  Thus,  KAP1  is  conjugated  with  SUMO  on  multiple  sites,  both  in  vivo  and  in  vitro. 

Mutation  of  the  PHD  Domain  Inhibits  KAP1  Sumoylation 

The  use  of  truncated  proteins  showed  that  SUMO  modification  occurs  only  in  the  C- 
terminal  half  of  KAP1  (NPB  fragment,  see  Figure  1F,G)  overlapping  the  known  repression 
domains:  NHD  (N-CoR2  Homology  Domain,  see  Figure  SI  A),  the  PHD  and  the  bromodomain. 
A  structural  analysis  of  the  KAP1  PHD  domain  showed  it  to  be  highly  related  to  the  RING  finger 
(Capili  et  al.,  2001),  often  found  in  ubiquitin  ligases  and  in  SUMO  E3  ligases  of  PIAS  family 
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(Joazeiro  and  Weissman,  2000).  The  RING  domain  in  PIAS  proteins  is  responsible  for 
interaction  with  Ubc9  and  its  structural  mutation  leads  to  inhibition  of  sumoylation  activity 
((Kotaja  et  al.,  2002)  and  Figure  3F).  To  test  the  hypothesis  that  the  PHD  domain  can  function  in 
a  similar  way  to  the  RING  finger  of  the  PIAS  ligases,  we  introduced  a  C651A  mutation,  which 
disrupts  its  structure  (Capili  et  al.,  2001).  SUMO  modification  of  the  C651A  mutant  protein  was 
drastically  reduced  (Figure  1F,G,  lane  4).  However,  the  intact  PHD  was  required  for  sumoylation 
of  many,  but  not  all  sites  in  KAP1  (Figure  1F,G,  arrow).  Sumoylation  of  the  NP  fragment  which 
lacks  the  bromodomain  was  unaffected  by  C651A  mutation  (Figure  1H).  Therefore,  mutation  in 
the  PHD  affected  sumoylation  of  the  bromodomain,  but  not  the  adjacent  NHD  domain  or  the 
PHD  itself.  These  results  suggest  that  the  PHD  domain  plays  a  key  role  in  KAP1  sumoylation 
and  it  is  required  to  target  the  adjacent  bromodomain. 

Selective  Utilization  of  Sumoylation  Sites  and  Their  Contribution  to  KAP1  Repression 
Activity 

First,  we  confirmed  putative  KAP1  sumoylation  sites  by  lysine-to-arginine  mutational 
analysis  and  assessed  the  contribution  of  sumoylation  to  the  repression  function  of  KAP1.  In 
agreement  with  our  previous  report  (Schultz  et  al.,  2001),  the  NHD,  PHD  and  bromodomain  all 
demonstrated  significant  repression.  Mutation  of  the  canonical  consensus  lysine  K554  in  the 
NHD  resulted  in  reduction  in  sumoylation  and  repression  (Figure  2A,B,  lane  2).  Mutation  of 
K575  modestly  reduced,  while  the  double  mutation  abolished,  both  sumoylation  and  repression 
(lane  4).  Mutation  of  K676  in  the  PHD  domain  had  a  similar  effect  (lane  6).  Simultaneous 
mutation  of  K779  and  K804  in  the  bromodomain  completely  abrogated  sumoylation  and 
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repression  (lanes  8-10).  Introduction  of  triple  substitutions  in  the  PB  (PHD-Bromodomain) 
module  revealed  an  additional,  cryptic  acceptor  lysine  K750  in  the  bromodomain  which  was  not 
active  in  repression  assay  in  the  absence  of  the  PHD  (lane  10)  and  occurs  in  a  non-canonical 
motif,  QEKL.  Additional  inactivation  of  this  fourth  site  completely  abrogated  repression  (Figure 
2C,  lane  7).  It  is  noteworthy  that  mutation  of  other  lysines  in  the  bromodomain,  K770  and  K774 
had  no  such  effect  (lane  8).  Analysis  of  mutant  PB  proteins  with  only  one  wild  type  acceptor 
lysine  (out  of  four)  allowed  us  to  evaluate  relative  contribution  of  each  site  to  sumoylation 
efficiency  and  repression  activity  (Figure  2C,D).  Fysines  K779  and  K804  were  predominant 
conjugation  sites  and  most  efficiently  recapitulated  repression  of  the  wild  type  protein.  Although 
SUMO  conjugation  to  K676  and  K750  was  much  less  efficient  we  confirmed  that  sumoylation  of 
these  sites  contributed  to  repression.  We  took  advantage  of  a  specific  inhibitor  of  the  endogenous 
sumoylation  machinery,  Garni,  which  targets  the  SUMO  El  for  degradation  (Boggio  et  al., 

2004).  When  each  mutant  was  cotransfected  with  the  wild  type  Garni,  the  repression  activity 
was  strongly  reduced  (Figure  2E),  suggesting  that  each  lysine  is  a  site  for  in  vivo  sumoylation 
and  that,  irrespective  of  other  modifications  which  may  occur  on  these  lysines,  sumoylation  has  a 
major  contribution  to  repression. 

Similar  analysis  of  the  NPB  fragment  of  KAP1  containing  all  six  acceptor  lysines 
allowed  us  to  establish  a  strict  hierarchy  in  the  site  utilization.  The  most  efficiently  sumoylated 
sites  were  K779,  K804  and  K554,  and  less  actively  utilized  -  K575,  K676  and  K750  (Figure 
2F,G).  Moreover,  the  difference  in  SDS-PAGE  mobility  of  SUMO-conjugated  NPB  mutants 
helped  us  to  assign  specific  bands  to  a  particular  acceptor  lysine  modification.  Thus,  sumoylation 
of  the  wild  type  NPB  fragment  revealed  one  major  mono-sumoylation  product  and  several  minor 
slower  migrating  bands  (lane  1).  This  predominant,  mono- sumoylated  NPB  protein  migrated  in 
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identical  position  to  the  SUMO  conjugates  at  the  bromodomain  sites  K779  or  K804  (lanes  6,7; 
see  also  Figure  SIB),  suggesting  that  sumoylation  of  KAP1  primarily  occurs  on  these  two  sites. 
Again,  use  of  mutant  NPB  proteins  with  only  one  wild  type  acceptor  lysine  (out  of  six)  showed  a 
very  good  correlation  between  the  extent  of  sumoylation  and  repression  activity  (Figure  2F-FI). 
Mutation  of  all  six  acceptor  lysines  almost  completely  abolished  sumoylation  and  repression 
(lane  8).  Thus,  sumoylation  of  KAP1  is  required  for  its  repression  function  and  plays  pivotal  role 
in  KAP1  mediated  silencing. 

The  KAP1  PHD  Binds  to  Ubc9  with  High  Selectivity  and  Directs  Sumoylation  of  the 
Adjacent  Bromodomain 

Next,  we  investigated  whether  the  PHD  domain  is  indeed  a  SUMO  E3  ligase.  The  PHD- 
Bromodomain  module  (PB)  was  a  >10-fold  better  substrate  for  sumoylation  and  better  repressor 
than  either  domain  alone  (Figure  2A,B,  compare  lanes  5,7  and  12).  Mutation  of  the  PHD 
(C651A)  greatly  diminished  SUMO  conjugation  to  the  bromodomain  sites  K779  and  K804  and 
impaired  repression  (Figure  3A,B,  lanes  5-7)  implying  that  PHD  enhances  bromodomain 
sumoylation. 

To  determine  if  the  PHD  domain  can  directly  associate  with  the  SUMO  E2  enzyme, 
Ubc9,  or  other  components  of  the  SUMO  conjugation  machinery  we  performed  GST  binding 
experiments  with  purified  proteins.  Recombinant  PHD  protein  bound  strongly  and 
stochiometrically  to  Ubc9  but  did  not  bind  to  SUMOl,  Sael  or  Sae2  (Figure  3C).  Structural 
mutations  in  the  PHD  domain,  C651A  or  W664A  completely  abolished  binding  to  Ubc9  (Figure 
3D,  lanes  3,6,9).  Interestingly,  the  PHD  domains  from  the  closely  related  TIF1/KAP1  family 
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members  TIFla  and  TIFly  as  well  as  from  the  chromatin  protein  SplOOC  were  unable  to  bind  to 
Ubc9  (Figure  S2A).  Moreover,  the  PHD-Bromodomain  (PB)  modules  from  TIFla,  TIFly  and 
SplOOC  were  very  poor  sumoylation  substrates  compared  to  the  KAP1  PB  (Figure  3E)  despite 
the  fact  that  they  contain  similar  sumoylation  consensus  sites  (Figure  S2B).  Thus,  the  correlation 
between  Ubc9  binding  and  sumoylation  provides  strong  evidence  that  KAP1  PHD-Ubc9 
interaction  is  critical  for  KAP1  SUMO  modification. 

To  compare  KAP1  with  the  known  SUMO  E3  ligase  PIAS1  we  analyzed  their 
sumoylation  efficiency  in  relation  to  the  structural  integrity  of  the  Ubc9  binding  determinants, 
PHD  and  RING  domains,  respectively.  Both,  RING  and  PHD  domains  were  poor  substrates 
themselves  (Figure  3F,  lanes  1  and  5).  However,  they  efficiently  stimulated  sumoylation  of 
adjacent  domains,  N-terminal  globular  domain  in  PIAS1  (RR)  and  bromodomain  in  KAP1  (PB), 
respectively  (lanes  3  and  7).  Structural  mutations  in  both  RING  and  PHD  domains,  which  disrupt 
Ubc9  binding,  drastically  reduced  PIAS1  and  KAP1  sumoylation  (lanes  4,8)  suggesting  that  the 
KAP1  PHD  domain  functions  similarly  to  the  PIAS1  RING  domain  and  acts  as  an  intramolecular 
SUMO  E3  ligase. 

To  distinguish  whether  the  KAP1  PHD  can  function  as  a  SUMO  ligase  in-trans  as  well  as 
in-cis,  we  tested  its  ability  to  stimulate  sumoylation  of  the  bromodomain  in  reactions  containing 
the  two  domains  as  separated,  individual  molecules.  Addition  of  increasing  amounts  of  PHD 
protein  to  the  bromodomain  or  PB  with  mutated  PHD  domain  did  not  result  in  increased  SUMO 
conjugation  (Figure  S2C).  However,  when  PHD  was  fused  to  the  C-terminus  of  the 
bromodomain  (Bromo-PHD  chimera)  sumoylation  was  significantly  enhanced  (Figure  3G, 
compare  lanes  2  and  4)  indicating  that  covalent  link  between  the  two  domains  is  required  for 
proper  SUMO  ligation. 
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Mapping  of  Ubc9  Binding  Site  on  the  KAP1  PHD  Domain 


To  map  interaction  interface  in  the  KAP1  PHD  involved  in  Ubc9  binding  we  performed 
NMR  titration  experiments.  As  shown  in  the  2D  'H-1sN  HSQC  (heteronuclear  single  quantum 
correlation)  NMR  spectra  of  the  15N-labeled  KAP1  PHD  domain  (Figure  4A),  many  amino  acid 
residues  of  the  protein  exhibited  significant  chemical  shift  perturbations  -  predominantly  line¬ 
broadening  effect  -  upon  addition  of  Ubc9  in  a  concentration  dependent  manner,  indicative  of 
direct  PHD-Ubc9  interactions.  The  ligand  binding  induced  line-broadening  effect  likely  results 
from  the  dissociation  rate  of  the  PHD-Ubc9  complex  on  the  millisecond  NMR  timescale, 
suggesting  a  low  micromolar  binding  affinity  of  the  two  proteins.  By  the  NMR  resonance 
assignments  (Figure  S3)  we  further  mapped  the  Ubc9  binding  site  on  the  protein  and  color-coded 
the  most  perturbed  amino  acid  residues  on  the  surface  representation  model  of  the  PHD  molecule 
(Figure  4B).  An  exhaustive  mutational  analysis  of  the  KAP1  PHD  domain  sequence,  which 
included  almost  every  amino  acid  substitution,  revealed  a  very  good  correlation  between 
decreased  in  vitro  sumoylation  efficiency  of  a  particular  missense  mutant  and  the  NMR 
resonance  perturbation  of  corresponding  residue  upon  Ubc9  titration  (Figure  S4  and  4C). 

Interestingly,  most  of  the  Ubc9  perturbed  residues  are  located  largely  on  one  side  of  the 
PHD  molecule  and  belong  to  both  zinc  coordination  sites,  as  well  as  to  loop  1  between  |3i  and  |>2 
strands,  loop  2  between  (T  and  [U  strands  and  to  an  adjacent  region  containing  H652  and  L653 
(Figure  4C)  at  the  PHD/bromodomain  interface  (see  accompanying  paper,  Zeng  et  al).  The  loop 
1  perturbed  residues  are  well  conserved  among  PHD  domains  of  TIF1  family  members,  whereas 
the  loop  2  and  [>4  strand  residues,  and  L653  are  unique  for  the  KAP1  PHD  (Figure  4D  and  S5A). 
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We  performed  subdomain  swaps  of  the  most  diverged  contiguous  amino  acid  sequences  between 
the  KAP1  and  TI F  I  7  PHD  domains  where  KAP1  PHD  segments  containing  loops  1,  2  or  3  were 
replaced  with  corresponding  TIFly  sequences  (6  aa  substitutions)  (Figure  4D).  The  respective 
mutants,  LI,  L2  or  L3  were  tested  in  GST  pull-down,  in  vitro  sumoylation  and  repression  assays 
(Figures  4E-G)  which  confirmed  that  the  L2  segment  of  the  KAP1  PHD  together  with  L653 
contain  the  critical  amino  acid  contacts/surfaces  required  for  binding  to  Ubc9.  We  verified  that 
the  L653A  and  L2  substitutions  did  not  cause  any  gross  changes  in  the  PHD  structural  fold  that 
were  observed  for  the  C651A  mutant  (Figure  S5B).  Together  these  results  provide  strong 
evidence  that  the  KAP1  PHD  directly  binds  to  Ubc9  and  can  function  as  a  SUMO  E3  ligase. 

Sumoylation  Is  Required  for  KRAB  Domain  Mediated  Repression 

Since  KAP1  is  an  obligate  corepressor  for  the  KRAB  domain,  we  first  sought  to 
determine  if  sumoylation  is  required  for  KRAB  domain  mediated  repression.  For  a  comparison, 
along  with  the  KRAB  several  other  well  characterized  repression  domains  were  tested.  LexA 
fusions  to  KRAB,  POZ,  SNAG  and  FOG  domains  were  cotransfected  with  the  Garni,  a  specific 
inhibitor  of  the  endogenous  sumoylation  pathway  (Boggio  et  al.,  2004).  Each  domain 
demonstrated  potent  repression  in  the  absence  of  Garni  (Figure  5A).  However,  the  repression 
activity  of  LexA-KAPl  PB  and  LexA-KRAB  was  markedly  reduced  by  wild  type  Garni  but  not 
the  Garni  mutant.  The  effect  of  Garni  on  repression  was  specific  for  the  KRAB-KAP1  pathway 
since  neither  POZ,  SNAG  nor  FOG  domains  showed  significant  relief  of  repression  in  the 
presence  of  Garni.  We  confirmed  that  expression  of  Garni  markedly  decreased  KAP1 
sumoylation  in  cells  (Figure  5B).  This  observation  emphasizes  the  importance  of  sumoylation  for 
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KRAB  domain  repression  function  and  sets  it  apart  from  mechanisms  of  action  of  the  other 
repressor  domains. 

Sumoylated  KAP1  is  Required  to  Complement  KRAB-mediated  Repression  in  KAP1 
Knockdown  Cells 

To  verify  that  sumoylation  of  KAP1  in  particular  is  required  for  KRAB  mediated 
repression  we  created  U20S  cell  lines  with  a  stable  knockdown  of  KAP1  using  vector-based 
shRNA  technology  (Wang  et  al.,  2005).  Clone  U20S-K4  showed  a  >95%  knockdown  of 
endogenous  KAP1  (Figure  S6A  and  5C).  Cotransfection  of  a  GAL4-TK-luciferase  and  Gal4- 
DBD  or  Gal4-KRAB  plasmids  into  control  U20S  cells  yielded  about  10-fold  repression, 
whereas  this  repression  activity  in  U20S-K4  cells  was  significantly  attenuated  (Figure  5C). 
Complementation  with  a  FLAG-KAP1  cDNA  resistant  to  the  action  of  the  shRNA  (Figure  S6B) 
restored  Gal4-KRAB  mediated  repression  to  18-fold.  In  contrast,  the  sumoylation  deficient 
mutant  of  KAP1,  K6R,  which  has  each  of  the  six  SUMO  acceptor  lysines  mutated  to  arginines, 
had  no  stimulatory  effect  on  Gal4-KRAB  repression  activity  (Figure  5D).  Both  wild  type  KAP1 
and  the  mutant  K6R  protein  were  properly  localized  to  pericentromeric  heterochromatin  (Figure 
S6C),  indicating  that  the  loss  of  repression  was  not  due  to  defect  in  the  K6R  subnuclear 
targeting.  The  KAP1  PHD  mutant,  C651A  which  showed  an  intermediate  level  of  sumoylation 
(Figure  1F,G)  demonstrated  poor  ability  to  complement  Gal4-KRAB-mediated  repression.  A 
fusion  of  the  K6R  protein  to  SUMOl  had  a  very  modest  ability  to  complement  KRAB  mediated 
repression  indicating  that  multiple  sumoylation  at  naturally  occurring  sites  in  KAP1  is  required 


13 


for  full  activity.  Altogether,  these  results  strongly  imply  that  KAP1  sumoylation  is  essential  for 
its  corepressor  function. 

SETDB1  and  CHD3  Interact  Only  with  Sumoylated  KAP1  in  Yeast 

Repression  mediated  by  the  KAP1  PHD-Bromodomain  (PB)  module  depends  on  its 
interaction  with  the  H3-K9  histone  methyltransferase  SETDB1  and  a  component  of  histone 
deacetylase  complex  -  chromatin  remodeling  factor  CHD3  (Schultz  et  al.,  2001;  Sripathy  et  al., 
2006).  We  asked  whether  these  interactions  are  influenced  by  sumoylation  of  KAP1.  Yeast  two 
hybrid  assay  showed  robust  interactions  of  wild  type  LexA-KAPl  PB  with  Gal4  activation 
domain  fusions  to  SETDB 1  or  CHD3  as  indicated  by  yeast  growth  under  the  appropriate 
selective  conditions  (Figure  6A).  However,  LexA-PB  with  the  K676R  mutation  interacted  with 
CHD3,  but  not  with  SETDB  1,  suggesting  that  sumoylation  of  K779  site  in  yeast  is  not  efficient 
enough  to  score  SETDB  1  binding.  Simultaneous  mutation  of  K676  and  K779  sites  abolished 
interaction  with  both  SETDB  1  and  CHD3.  Immunoprecipitation  and  Western  blot  analysis  of  the 
KAP1  bait  proteins  showed  that  only  K676  and  to  a  much  lesser  extent  K779  sites  were  subject 
to  sumoylation  in  yeast  (Figure  6B,  lanes  1-4).  Unlike  in  mammalian  cells,  the  yeast  SUMO 
protein,  Smt3p  formed  poly-Smt3p  chains  on  KAP1,  consistent  with  published  reports  ().  When 
the  LexA-KAPl  NPB  baits  were  analyzed,  we  observed  that  only  lysines  K554  and  K676  were 
required  for  association  with  SETDB  1  and  CHD3  (Figure  6A).  Among  the  six  identified 
sumoylation  sites  in  KAP1,  only  K554  and  K676  conform  to  the  canonical  consensus  (Figure 
1A)  suggesting  that  sumoylation  in  yeast  preferentially  occurs  on  canonical  sites.  These  results 
strongly  indicate  that  sumoylation  of  KAP1  is  required  for  binding  to  SETDB  1  and  CHD3. 
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Identification  of  Functional  SIM  Motifs  in  SETDB1  and  CHD3 


To  determine  if  binding  of  SETDB 1  or  CHD3  to  SUMO  may  be  sufficient  to  score 
genetic  interaction  with  sumoylated  KAP1,  we  used  LexA  fusions  to  human  SUMOl,  SUM02 
and  the  SUMOlmt  (K37A,K39A).  SETDB  1  and  CHD3  interacted  with  SUMOl  and  SUM02, 
but  not  with  the  SUMOlmt  (Figure  6D).  This  mutant  is  a  well-characterized,  two  amino  acid 
alteration  of  the  binding  pocket  of  the  SUMO  molecule  that  abolishes  its  interaction  with  SIM 
(SUMO  Interacting  Motif)  found  in  proteins  that  bind  SUMO.  We  hypothesized  that  SETDB  1 
and  CHD3  each  contain  SIM  motif  and  used  extensive  mutational  analysis  to  identify  it.  A  core 
SIM  with  the  consensus  sequence  V/I-X-V/I-V/I,  (X  -  any  amino  acid)  has  been  identified  and 
structurally  characterized  (Chupreta  et  al.,  2005;  Hecker  et  al.,  2006;  Song  et  al.,  2005).  Deletion 
of  23  aa  from  the  N-terminus  of  SETDB  1  fragment  abolished  yeast  growth  and  reporter  gene 
expression  (Figure  6C).  The  sequence  of  this  23  aa  peptide  encodes  a  4  aa  segment  (122-IIEI- 
125)  conforming  to  the  core  SIM  consensus  (Figure  6E).  When  this  putative  SIM  was  mutated 
the  interaction  with  KAP1  PB  was  abolished.  A  35  aa  C-terminal  truncation  of  CHD3  abolished 
its  interaction  with  KAP1  PB  (Figure  6C).  This  sequence  encodes  a  4  aa  segment  (1995-VICI- 
1998)  conforming  to  the  core  SIM  consensus.  When  the  six  C-terminal  residues  of  CHD3 
containing  this  putative  SIM  were  deleted  the  interaction  with  KAP1  PB  was  completely 
abrogated.  These  experiments  identified  bona  fide  SIM  domains  in  SETDB  1  and  CHD3.  In  both 
proteins,  the  SIM  domains  have  been  highly  conserved  throughout  vertebrate  evolution  (Figure 
S7A)  suggesting  that  they  play  an  important  functional  role. 
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SETDB1  and  CHD3  Bind  to  Sumoylated  KAP1  In  Vitro  in  a  SIM  Dependent  Manner 

To  confirm  that  these  interactions  are  direct,  we  reconstituted  them  in  vitro.  E.coli 
derived  GST  fusion  proteins  containing  a  high  proportion  (>50%)  of  properly  sumoylated  forms 
were  used  in  binding  assays  (Figure  6F,  bottom,  brackets).  SETDB1  showed  moderate  binding  to 
SUMOl  and  this  association  was  abolished  by  the  K37A,K39A  mutation  in  SUMOl 
(SUMOlmt)  (lane  4  and  Figure  S7B).  SETDB1  bound  poorly  to  unmodified  PB  but  the 
interaction  was  dramatically  stimulated  when  sumoylated  PB  (PB-S1)  or  sumoylated  full  length 
KAP1  (KAP1-S1)  were  used  (lanes  6,8).  SETDB1  which  lacks  the  SIM  motif  (SETDB  ldSIM) 
showed  markedly  reduced  binding  to  all  sumoylated  proteins  (middle).  SETDB1  showed  very 
little  affinity  to  highly  sumoylated  SplOOA  (Spl00-Sl,lane  10)  suggesting  that  the  SUMO 
modification  alone  is  not  the  sole  determinant  for  interaction  in  all  cases. 

We  observed  similar  results  in  reverse  binding  experiments  (Figure  6G).  Unmodified 
KAP1  did  not  bind  to  any  of  the  GST-SETDB1  or  GST-CHD3  resins  (lanes  2-7).  In  contrast, 
sumoylated  KAP1  was  efficiently  retained  by  wild  type  SETDB1  and  CHD3,  while  mutation  of 
their  SIM  motifs  completely  abolished  binding  (lanes  10-13).  Interestingly,  multi-sumoylated 
KAP1  conjugates  (higher  molecular  weight  bands)  bound  to  SETDB1  and  CHD3  more 
efficiently  than  mono-sumoylated  conjugates  despite  their  lower  abundance  in  the  preparation, 
providing  biochemical  evidence  to  explain  our  in  vivo  data  that  multi- sumoylation  of  KAP1  is 
required  for  efficient  KRAB-KAP1  mediated  repression.  Together,  these  data  indicate  that 
SUMO-SIM  interaction  provides  a  major  contribution  to  the  interaction  between  sumoylated 
KAP1  and  SETDB1  or  CHD3. 
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Endogenous  KAP1  binds  to  SETDB1  and  CHD3  in  a  Sumoylation  Dependent  Manner 

To  assess  the  role  of  sumoylation  for  KAP1-SETDB1/CHD3  interaction  in  vivo  we 
reintroduced  wild  type  or  different  functional  KAP1  mutants  into  our  U20S-K4  KAP1 
knockdown  cells.  Specifically,  we  selected  clonal  cell  lines  expressing  exogenous  FLAG-KAP1 
proteins  at  nearly  physiological  levels  (Figure  7A,  compare  lanes  1  and  4-8).  The  reconstituted 
KAP1  mutants  included  C2  (abolishes  trimerization/KRAB  binding)  (Peng  et  al.,  2000),  M2 
(abolishes  HP1  binding/subnuclear  targeting)  (Ryan  et  al.,  1999),  the  K6R  (abolishes 
sumoylation)  and  dB  (deletion  of  the  bromodomain,  decreases  sumoylation).  Cotransfection  of  a 
GAF4-TK-luciferase  and  Gal4-DBD  or  Gal4-KRAB  plasmids  into  K4/WT  cells  yielded  about 
17-fold  repression,  whereas  this  repression  activity  in  control  K4/vector  and  in  K4/C2,  K4/M2, 
K4/K6R  and  K4/dB  cells  was  significantly  attenuated  (top).  This  is  consistent  with  our  previous 
data  showing  that  KAP1  trimerization  (abolished  by  the  C2  mutation),  HP1  binding/subnuclear 
targeting  (abolished  by  the  M2  mutation)  and  sumoylation  (abolished  by  the  K6R  mutation, 
decreased  by  deletion  of  the  bromodomain  (dB))  (Figure  5D)  are  critical  for  KAP1  repression 
function.  Surprisingly,  in  addition  to  the  bona  fide  sumoylation  deficient  mutant,  K6R  all  the 
other  KAP1  functional  mutants  showed  dramatically  reduced  levels  of  SUMO  modification  in 
vivo  by  both,  SUMOl  and  SUM02  (Figure  7B,  middle,  lanes  14,  16-18)  suggesting  that 
impairment  in  sumoylation  might  be  the  major  factor  in  the  loss  of  their  repression  activity.  We 
detected  efficient  co-immunoprecipitation  of  endogenous  SETDB 1  and  Mi2/CHD3  with  wild 
type  KAP1  only  when  cells  were  lysed  and  processed  in  conditions  preserving  the  SUMO 
modification  (+NEM)  (top,  compare  lanes  15  and  21).  NEM  is  an  inhibitor  of  de-sumoylating 
SENP  enzymes,  which  deconjugate  SUMO  from  substrate  proteins  upon  cell  lysis  (lanes  7-12). 
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Consistent  with  this,  there  was  no  or  very  little  association  between  KAP1  and  Mi2/CHD3  or 
SETDB1  in  the  absence  of  NEM  (lanes  20-24).  Moreover,  markedly  reduced  sumoylation  of  the 
KAP1  mutants  resulted  in  lower  efficiency  of  SETDB1  and  Mi2/CHD3  co-immunoprecipitation 
(lanes  14,16-18).  Together  these  data  suggest  that  KAP1  sumoylation  plays  a  major  role  in  its 
interaction  with  SETDB 1  and  CHD3  in  vivo. 

KAP1  Sumoylation  is  Required  for  Recruitment  of  SETDB1  to  the  Target  Promoter 

To  confirm  this  conclusion  we  tested  the  requirement  of  KAP1  sumoylation  for  the 
recruitment  of  SETDB  1  to  target  promoter.  We  introduced  a  GAL4-TK-luciferase,  Gal4-DBD  or 
Gal4-KRAB  and  various  KAP1  variants  into  U20S-K4  KAP1  knockdown  cells  and  performed 
chromatin  immunoprecipitation  (ChIP)  assay  with  primers  specific  to  the  TK  promoter.  As 
expected,  all  Gal4  fusion  proteins  bound  to  the  promoter  (Figure  7C,  panel  a).  Consistently,  the 
wild  type  (WT),  K6R  and  M2  mutant  KAP1  proteins  were  efficiently  recruited  to  the  promoter 
by  Gal4-KRAB  but  not  by  Gal4-DBD  (panel  b,  lanes  5,7,9).  However,  SETDB  1  and  the  histone 
H3  K9-trimethyl  mark  were  only  enriched  at  the  promoter  in  cells  transfected  with  wild  type 
KAP1  or  control  vector  (panels  c,d,  lanes  3,5).  Interestingly,  since  HP1  binding  to  KAP1  does 
not  directly  depend  on  KAP1  sumoylation  (Ryan  et  al.,  1999),  HPla  was  recruited  to  the 
promoter  by  the  K6R  mutant  as  efficiently  as  by  wild  type  KAP1,  but  it  did  not  result  in 
appearance  of  the  histone  H3  K9-trimethyl  mark  and  subsequent  repression.  Conversely,  the 
HP  1 -binding  deficient  mutant,  M2  did  not  recruit  HP1  and  neither  SETDB  1  nor  the  histone  H3 
K9-trimethyl  mark  (panels  c-e,  lane  9),  since  it  is  also  defective  in  sumoylation  (Figure  7B,  lane 
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17).  Together,  these  results  suggest  that  KAP1  sumoylation  is  required  for  the  SETDB1 
recruitment  to  the  target  promoter  and  the  establishment  of  repressive  chromatin  marks. 

Sumoylated  KAP1  Stimulates  SETDB1  HMTase  Activity  in  Vitro 

To  investigate  whether  sumoylated  KAP1  influences  SETDB1  enzymatic  activity,  we 
performed  in  vitro  histone  methyltransferase  (HMT)  assay  in  the  presence  of  unmodified  or 
sumoylated  KAP1  PB  module  (see  Figure  S8A).  Low  levels  (100  ng)  of  baculovirus-expressed 
SETDB1  were  incubated  with  core  histones  in  a  standard  HMTase  reaction  containing  increasing 
amounts  of  KAP1  PB.  Under  these  conditions,  SETDB1 -mediated  methylation  of  histone  H3 
was  undetectable  in  the  absence  of  KAP1  (Figure  7D,  lane  2).  (A  twenty  fold  higher  amount  of 
SETDB 1  (2  pg)  was  required  to  produced  robust  methylation  of  the  histone  H3  (data  not 
shown)).  When  unmodified  KAP1  PB  was  added  to  the  reaction,  a  low  level  of  H3  methylation 
was  detected  (lane  5).  However,  the  addition  of  sumoylated  KAP1  PB  resulted  in  a  robust,  dose- 
dependent  increase  in  histone  H3  methylation  (lanes  6-8),  suggesting  that  sumoylated  KAP1, 
which  binds  SETDB  1,  enhanced  its  catalytic  activity.  Similar  titration  reactions  with  SUMOl 
protein  alone  did  not  result  in  stimulation  of  SETDB  1  (Figure  S8B)  suggesting  that  KAP1  and 
SUMO  sequences  together  contribute  to  SETDB  1  activation.  In  summary,  these  data  strongly 
indicate  that  sumoylated  KAP1  can  increase  SETDB  1  HMTase  activity  toward  histone  H3. 
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DISCUSSION 


KAP1  PHD  is  an  Intramolecular  SUMO  E3  Ligase  for  the  Adjacent  Bromodomain 

Tandem  PHD  and  bromodomains  are  often  found  in  chromatin  associated  proteins  and 
have  been  shown  to  cooperate  in  gene  regulation.  Based  upon  recent  structural  analyses,  it  has 
been  proposed  that  tandem  PHD  and  bromodomains  can  independently  recognize  modifications 
in  histones,  and  thus  may  act  combinatorially  to  interpret  the  histone  code  (Li  et  al.,  2006).  In 
this  article  we  provide  an  alternative  basis  for  the  cooperativity  of  these  domains  in  corepressor 
protein  KAP1  by  demonstrating  that  the  PHD  domain  can  function  as  a  SUMO  E3  ligase  in-cis 
for  the  bromodomain. 

We  found  that  the  PHD  domain  of  KAP1  functions  in  a  similar  way  to  the  RING  finger 
of  the  PIAS  SUMO  E3  ligases.  It  directly  binds  to  the  SUMO  E2  enzyme  Ubc9  and  is  necessary 
for  KAP1  sumoylation.  The  immediate  target  of  the  KAP1  PHD  ligase  activity  is  the  adjacent 
bromodomain.  In  addition  to  binding  to  the  E2  enzyme  a  true  E3  ligase  has  to  fulfill  the  second 
criteria  -  interact  with  its  target.  The  solution  structure  of  the  KAP1  PHD-Bromodomain  tandem 
confirmed  that  these  two  domains  physically  interact  and  form  a  structural  and  functional 
module  (Zeng  et  al.,  see  accompanying  paper).  This  interaction  seems  to  be  important  since 
chimeras  between  KAP1  PHD  and  bromodomains  from  other  TIF1  family  members  fail  to  fully 
recapitulate  repression  activity  (Schultz  et  al.,  2001). 

It  seems  likely  that  the  KAP1  PHD  may  also  function  as  a  SUMO  E3  not  only 
intramolecularly  but  also  in-trans,  by  enhancing  sumoylation  of  other  KAP1  interacting  proteins. 
Interestingly,  we  have  not  detected  KAP1  dependent  stimulation  of  SETDB1  and  CHD3  protein 
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sumoylation  implying  that  they  are  effector  molecules  for  KAP1  but  are  not  direct  targets  for  the 
-E2-E3  (Ubc9-PHD)  cascade. 

Sumoylation  Dependent  Interaction  of  KAP1  with  Its  Effector  Proteins 

It  has  been  demonstrated  that  controlled  recruitment  of  a  Gal4-fusion  of  KAP1  to  a 
chromatinized  GAL4-TK-luciferase  transgene  results  in  targeted  localization  of  SETDB 1  to  the 
promoter  region,  a  local  increase  in  histone  H3-K9  tri-methylation  and  repression.  Transient 
knockdown  of  SETDB  1  in  these  conditions  compromises  repression  and  thus  provides  evidence 
for  functional  link  between  KAP1  repression  activity  and  SETDB  1  recruitment  (Sripathy  et  al., 
2006).  Here  we  provide  direct  evidence  that  the  interaction  between  KAP1  and  SETDB  1  or 
CHD3  depends  on  KAP1  sumoylation  and  is  a  function  of  their  specific  SIM  motifs.  We  show 
that  SETDB  1  and  CHD3  are  able  to  interact  with  SUMOl  and  SUM02.  Consistent  with  this 
finding,  SETDB  1  was  recently  identified  among  other  bound  proteins  in  GST-SUM02  affinity 
chromatography  purifications  (Rosendorff  et  al.,  2006).  It  appears  that  specific  recognition  of 
SUMO-conjugated  transcriptional  regulators  by  individual  components  of  repression  machinery 
could  be  a  general  phenomenon.  It  has  recently  been  shown  that  another  partner  protein  of 
SETDB  1,  MCAF1,  contains  SIM  domain  which  facilitates  its  association  with  sumoylated 
MBD1  (Uchimura  et  al.,  2006).  Similarly,  recruitment  of  the  DAXX  protein  by  SUMO-modified 
glucocorticoid  receptor  is  dependent  on  its  SIM  domain  (Lin  et  al.,  2006). 

Our  discoveries  are  summarized  in  a  schematic  model  describing  the  mechanism  of 
KRAB-KAP1  mediated  repression  (Figure  7D).  Like  other  SUMO  substrates,  it  appears  that  only 
a  small  percentage  of  total  cellular  KAP1  may  be  SUMO-modified  at  any  one  time.  We  favor  the 
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hypothesis  that  sumoylation  of  KAP1  occurs  in  a  controlled,  transient  fashion  only  at  specific 
sites  in  chromatin,  for  example  upon  KRAB-ZFP-KAP1  interaction  at  the  target  gene  promoters. 
The  multiple  KAP1-SUMO  species  observed  in  vivo  suggest  that  KAP1  sumoylation  occurs  on 
many  sites  simultaneously  with  the  major  ones  located  in  the  bromodomain.  The  PHD-Ubc9 
interaction  is  required  for  this  modification  and  potentially  could  also  be  regulated.  Once 
modified  by  SUMO,  KAP1  serves  as  a  scaffold  protein  and  recruits  repression  machinery 
through  the  recognition  of  the  conjugated  SUMO  moieties  by  the  SIM  motifs  of  CHD3  and 
SETDB1,  which  recruit  their  associated  proteins.  Moreover,  the  KAP1-SETDB1  interaction  not 
only  results  in  recruitment  of  SETDB1  but  also  stimulates  the  HMTase  activity  of  the  enzyme. 
Ultimately,  action  of  these  effector  molecules  leads  to  chromatin  reorganization  at  the  promoter 
region,  i.e.  histone  deacetylation  and  methylation  of  histone  H3-K9.  Notably,  KAP1  scaffolding 
function  is  further  exploited  through  recruitment  of  HP1.  HP1  recognizes  the  H3-K9  methyl 
mark  and  participates  in  establishment  of  repressive  chromatin  state. 

In  summary,  this  study  provides  conclusive  evidence  that  the  KAP1  PHD  finger  acts  as 
an  intramolecular  SUMO  E3  ligase  for  the  adjacent  bromodomain  and  sumoylation  of  KAP1  as  a 
major  regulatory  switch  required  for  KRAB-KAP1  mediated  repression. 
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FIGURE  LEGENDS 


Figure  1.  KAP1  is  Sumoylated  in  Vivo  and  in  Vitro  in  a  PHD-Dependent  Manner 

(A)  Sequence  alignment  of  six  KAP1  sumoylation  sites.  Forward  and  reverse  sumoylation 
consensuses  are  shown  and  boxed. 

(B)  Schematic  representation  of  KAP1  structural  domains.  B1  -  box  1,  B2  -  box  2,  CC  -  coiled- 
coil,  HP1BD  -  HP1  binding  domain.  Amino  acid  positions  of  KAP1  sumoylation  site  lysines  and 
various  domain  inactivating  mutations  are  indicated.  Deletion  mutants  of  KAP1  utilized  are 
abbreviated  with  the  first  letters  of  each  domain  (PB  =  PHD-Bromodomain).  Sumoylation 
deficient  mutant  with  all  six  sumoylation  site  lysines  substituted  to  arginines  is  designated  RR-R- 
RRR  or  K6R. 

(C)  Proteins  were  immunoprecipitated  from  HI 299  cells  with  pre-immune  serum  or  antibodies 
specific  for  different  regions  of  KAP1 :  PB  or  RBCC;  or  an  unrelated  LimDl  antibody.  A 
SUMOl  antibody  was  used  in  Western  blot.  Sumoylated  forms  of  KAP1  are  indicated  with 
arrows. 

(D)  HEK293  cells  were  transfected  with  empty  vector,  Ubc9,  SENP1  or  dnSENPl  constructs 
with  or  without  HA-SUMO  1  plasmid  as  indicated.  Proteins  were  detected  by  Western  blot  with 
KAP1  antibody.  Sumoylated  form  of  KAP1  is  indicated  with  an  arrowhead. 
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(E)  In  vitro  sumoylation  of  GST-KAP1.  Immobilized  GST  and  GST-KAP1  were  incubated  in 
the  presence  or  absence  of  recombinant  El,  Ubc9,  and  "P-labeled  SUMOl  as  indicated.  The 
reaction  products  were  analyzed  by  coomassie  blue  staining  (bottom)  and  autoradiography  (top). 


(F)  HEK293  cells  were  transfected  with  the  indicated  FLAG-KAP1  and  T7-SUM01  plasmids. 
Proteins  were  analyzed  by  Western  blot  with  FLAG  antibody. 

(G-H)  In  vitro  sumoylation  of  indicated  GST-KAP1  proteins.  The  major  mono-sumoylated 
forms  of  KAP1  are  indicated  with  arrowheads.  Arrows  indicate  the  position  of  major  PHD- 
independent  SUMO  conjugates. 


Figure  2.  Mapping  Sumoylation  Sites  and  Repression  Domains  in  KAP1 

(A)  In  vitro  sumoylation  of  GST  fusions  of  indicated  KAP1  domains  wild  type  or  various 
sumoylation  site  mutants  was  performed  as  in  Fig.  IE.  The  reaction  products  were  analyzed  by 
coomassie  blue  staining  (bottom)  and  autoradiography  (top).  Arrowheads  and  arrows  indicate  the 
positions  of  mono-  and  di-sumoylated  forms,  respectively. 

(B)  HEK293  cells  were  transfected  with  plasmids  expressing  the  same  KAP1  domains  as  in  (A) 
fused  to  LexA,  together  with  4xLexA-TK-luc  plasmid  and  a  pCMV-|3-gal  normalization  vector. 


27 


(C)  HEK293  cells  were  transfected  with  indicated  LexA-PB  plasmids,  4xLexA-TK-luc  plasmid 
and  a  pCMV-|3-gal  normalization  vector.  The  expression  of  LexA-PB  proteins  was  confirmed 
by  Western  blot  (bottom). 

(D)  HEK293  cells  were  transfected  with  T7-SUM01  and  indicated  LexA-PB  plasmids.  LexA- 
PB  proteins  were  immunoprecipitated  with  LexA  antibody  and  analyzed  by  Western  blot  with 
SUMOl  (top)  and  LexA  antibodies  (bottom).  Asterisk  -  IgGs. 

(E)  HEK293  cells  were  transfected  with  indicated  LexA-PB  plasmids  and  wild  type  or  mutant 
Garni  together  with  4xLexA-TK-luc  plasmid  and  a  pCMV-|3-gal  normalization  vector.  The 
expression  of  LexA-PB  and  Myc-Gaml  proteins  was  confirmed  by  Western  blot  (bottom). 

(F)  In  vitro  sumoylation  of  indicated  GST-NPB  KAP1  proteins  was  performed  as  in  Pig.  IE.  The 
reaction  products  were  analyzed  by  coomassie  blue  staining  (bottom)  and  autoradiography  (top). 

(G)  HEK293  cells  were  transfected  with  HA-SUMO  1  and  indicated  LexA-NPB  plasmids.  LexA- 
NPB  proteins  were  immunoprecipitated  with  LexA  antibody  and  analyzed  by  Western  blot  using 
HA  (top)  and  KAP1  antibodies  (bottom). 

(H)  HEK293  cells  were  transfected  with  the  same  LexA-NPB  plasmids  as  in  (G)  together  with 
4xLexA-TK-luc  plasmid  and  apCMV-|3-gal  normalization  vector.  The  expression  of  LexA- 
NPB  proteins  was  confirmed  by  Western  blot  (bottom). 

(B,C,E,H)  Data  are  the  mean  +  SD  of  at  least  two  experiments  performed  in  duplicate. 
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Figure  3.  The  KAP1  PHD  Binds  to  Ubc9  with  High  Selectivity  and  Directs  Sumoylation  of 
the  Adjacent  Bromodomain 

(A)  In  vitro  sumoylation  of  GST-PB  wild  type,  indicated  sumoylation  site  mutants  and  C651A 
mutant  proteins.  Arrowheads  and  arrows  indicate  the  positions  of  mono-  and  di-sumoylated 
forms,  respectively. 

(B)  HEK293  cells  were  transfected  with  indicated  LexA-PB  plasmids  together  with  4xLexA-TK- 
luc  plasmid  and  a  pCMV-[3-gal  normalization  vector. 

(C)  GST  and  GST  fusions  of  SUMOl,  Ubc9,  Sael  and  Sae2  were  immobilized  on  glutathione 
beads  and  incubated  with  recombinant  KAP1  PHD.  Bound  proteins  were  analyzed  by  coomassie 
blue  staining.  Lane  1  was  loaded  with  a  protein  marker  in  addition  to  an  input  sample  of  the  PHD 
protein. 

(D)  GST  and  GST-Ubc9  were  immobilized  on  glutathione  beads  and  incubated  with  the 
indicated  recombinant  KAP1  PHDs.  Bound  proteins  were  analyzed  by  coomassie  blue  staining. 

(E)  In  vitro  sumoylation  of  GST-PB  modules  from  KAP1,  TIFla,  TIFly  and  SplOOC  was 
performed  as  in  Fig.  IE.  The  reaction  products  were  analyzed  by  coomassie  blue  staining 
(bottom)  and  autoradiography  (top). 
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(F)  In  vitro  sumoylation  of  indicated  GST  fusion  proteins:  RING  or  RING  with  adjacent  domain 
(RR)  from  mouse  PIAS1  (aa  140-320),  PHD  or  PB  from  KAP1,  was  performed  as  in  Fig.  IE.  The 
reaction  products  were  analyzed  by  coomassie  blue  staining  (bottom)  and  autoradiography  (top). 

(G)  In  vitro  sumoylation  of  GST-PHD-Bromo  and  GST-Bromo-PHD  chimera  was  performed  as 
in  Fig.  IE.  The  reaction  products  were  analyzed  by  coomassie  blue  staining  (bottom)  and 
autoradiography  (top). 


Figure  4.  Mapping  of  Ubc9  Binding  Site  on  the  KAP1  PHD  Domain 


(A)  2D  'H-1sN  HSQC  spectral  comparison  of  15N-labeled  KAP1  PHD  domain  in  the  free  form 
(black)  and  in  the  presence  of  Ubc9  (red).  Molar  ratio  of  KAP1  PHD  finger  to  Ubc9  is  1:2.4. 
(Inset)  Expansion  of  spectral  region  illustrating  three  peaks  that  undergo  line  broadening  during 
Ubc9  titration.  The  blue  peaks  correspond  to  a  mid-point  of  the  titration,  in  which  PHD:Ubc9 
molar  ratio  is  1:1.2. 

(B)  Ribbon  and  surface  representations  of  the  KAP1  PHD  domain  structure  (PDB  code  1FP0), 
highlighting  the  residues  that  exhibited  major  resonance  perturbations  upon  Ubc9  binding.  KAP1 
residues  corresponding  to  NMR  peaks  that  undergo  line  broadening  at  a  PHD:Ubc9  molar  ratio 
of  1:1.2  are  in  red,  and  those  affected  at  1:1.8  are  in  orange. 


(C)  Sequence  alignment  of  PHD  domains  from  KAP1  and  its  closest  human  PHD  homologue 
BPTF  as  well  as  from  TIF1  family  proteins  and  SplOOC.  Residues  are  color-coded  according  to 
ClastalW  color  scheme.  The  KAP1  PHD  residues  perturbed  in  NMR  titration  experiment  (A,B) 
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are  shaded  in  yellow,  and  aa  substitutions  which  showed  inhibitory  effect  in  sumoylation  assay 
(Figure  S3)  are  shown  underneath.  Chelating  cysteine  and  histidine  residues  are  shaded  in  grey. 
Residues  of  two  (3-turn- (3  units  of  anti-parallel  (3-sheet  are  shaded  in  cyan. 

(D)  Sequence  alignment  of  PHD  domains  from  KAP1  and  TIFly.  The  6  aa  substitutions  of 
predicted  loop  segments  LI,  L2  and  L3  are  shown  above.  Identical  residues  are  shaded  in  grey 
(chelating)  and  cyan. 

(E)  GST  and  GST-Ubc9  were  immobilized  on  glutathione  beads  and  incubated  with  the 
indicated  recombinant  KAP1  PHDs.  Bound  proteins  were  analyzed  by  coomassie  blue  staining. 

(F)  In  vitro  sumoylation  of  GST-PB  wild  type  or  indicated  loop  mutants  was  performed  as  in 
Fig.  IE.  The  reaction  products  were  analyzed  by  coomassie  blue  staining  (bottom)  and 
autoradiography  (top). 

(G)  U20S  K4  cells  were  transfected  and  treated  as  in  Fig.5C  except  that  empty  vector,  FLAG- 
KAP1  wild  type  or  indicated  loop  mutant  plasmids  were  included  into  the  transfection  mixture. 


Figure  5.  Role  of  KAP1  Sumoylation  in  KRAB  Domain  Mediated  Repression 

(A)  HEK293  cells  were  transfected  with  plasmids  encoding  LexA  fusions  of  indicated  repression 
domains  together  with  empty  vector,  wild  type  or  mutant  Garni,  and  4xLexA-TK-luc  plasmid 
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with  a  pCMV-p-gal  normalization  vector.  The  expression  of  the  LexA  fusion  proteins 
(arrowheads)  was  confirmed  by  Western  blot  (bottom). 

(B)  HEK293  cells  were  transfected  with  FLAG-KAP1  and  100  ng  of  T7-SUM01  plasmid 
together  with  empty  vector,  wild  type  or  mutant  Garni  plasmids.  Lysates  were  Western  blotted 
with  Myc  antibody  to  detect  Garni  protein  (bottom).  Asterisk  -  non-specific  band.  KAP1  was 
immunoprecipitated  with  FLAG  M2  antibody  and  Western  blotted  with  FLAG  (lanes  1-3), 
SUMOl  (lanes  4-6)  and  SUM03  (lanes  7-9)  antibodies. 

(C)  U20S-G1  and  -K4  cells  were  transfected  with  Gal4-DBD  or  Gal4-KRAB  plasmids  together 
with  5xGal4-TK-luc  plasmid  and  apTK-[3-gal  normalization  vector.  Endogenous  KAP1  protein 
expression  level  was  verified  by  Western  blot  (bottom). 

(D)  U20S-K4  cells  were  transfected  and  treated  as  in  (C)  except  that  empty  vector,  FLAG- 
KAP1  wild  type  or  indicated  mutant  plasmids  were  included  into  the  transfection  mixture  (top). 
The  expression  of  FLAG-KAP1  proteins  was  confirmed  by  Western  blot  (bottom). 

(A,C,D)  Data  are  the  mean  +  SD  of  at  least  two  experiments  performed  in  duplicate. 


Figure  6.  SETDB1  and  CHD3  Interact  with  KAP1  in  Yeast  and  in  Vitro  in  SIM  Dependent 
Manner 
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(A,C,D)  Growth  selection  and  quantitative  (3-gal  analyses  of  L40a  yeast  cells  transformed  with 
indicated  plasmids.  The  growth  was  scored  on  a  three-point  scale:  (+++)  -  robust  growth,  (++)  - 
slow  growth  and  (-)  -  no  growth  on  triple  drop-out  plates  (right). 

(B)  L40a  cells  transformed  with  LexA-PB  and  -NPB  constructs  as  in  (A)  were  grown  in  double 
drop-out  media.  LexA  fusion  proteins  were  immunoprecipitated  with  LexA  antibody  and 
analyzed  by  Western  blot  with  Smt3p  (top)  and  LexA  antibodies  (bottom). 

(C)  Schematic  representation  of  SETDB1  and  CHD3  domain  structure:  T1  and  T2  -  Tudor 
domains,  P  -  PHD,  C  -  chromodomain.  SIM  is  indicated  by  a  stripped  box. 

(E)  Sequence  alignment  of  the  SUMO  interacting  motifs  (SIM)  from  SETDB1  and  CHD3.  The 
SIM  consensus  is  boxed  and  the  flanking  acidic  residues  are  italicized. 

(F)  Indicated  GST  fusion  proteins  immobilized  on  glutathione  beads  were  incubated  with  "  S- 
labeled  SETDB1  or  SETDBldSIM.  Bound  proteins  were  analyzed  by  coomassie  blue  staining 
(bottom)  and  autoradiography  (top).  Brackets  indicate  positions  of  sumoylated  protein  forms. 

(G)  GST  and  GST  fusions  of  SETDB1,  SETDB1  SIMmt  (aa  113-272),  CHD3  (aa  1782-2000), 
CHD3dSIM  (aa  1782-1994)  and  CHD4  (aa  1768-1912)  immobilized  on  glutathione  beads  were 
incubated  with  recombinant  unmodified  or  sumoylated  full  length  KAP1.  Bound  proteins  were 
analyzed  by  coomassie  blue  staining.  Lanes  1  and  8  contain  input  samples. 
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Figure  7.  KAP1  Sumoylation  Promotes  Recruitment  of  SETDB1  to  the  Target  Promoter 
and  Stimulates  Its  Enzymatic  Activity 

(A)  U20S-G1,  -K4  and  indicated  reconstituted  K4/FLAG-KAP1  cell  lines  were  treated  as  in 
Fig.5C  (top).  The  expression  of  KAP1,  p53  and  HPla  proteins  was  confirmed  by  Western  blot 
(bottom). 

(B)  Proteins  from  indicated  reconstituted  U20S-K4/FLAG-KAP1  cell  lines  were 
immunoprecipitated  with  FLAG  M2  resin  in  the  presence  or  absence  of  5  mM  NEM  and 
Western  blotted  with  Mi2/CHD3,  SETDB1,  SUMOl,  SUM02  and  FLAG  antibodies. 

(C)  ChIP  analysis  of  KAPl-mediated  SETDB1  and  HPla  recruitment  to  the  target  TK  promoter. 
U20S-K4  cells  were  transfected  with  Gal4-DBD  or  Gal4-KRAB,  empty  vector,  FLAG-KAP1 
wild  type  or  indicated  mutant  plasmids  together  with  5xGal4-TK-luc  plasmid. 

(D)  Increasing  amounts  of  unmodified  or  sumoylated  KAP1  PB  were  mixed  with  100  ng  of 
SETDB1  along  with  5  pg  of  core  histones  and  subjected  to  an  in  vitro  HMTase  assay. 
Autoradiograph  shows  corresponding  H-me thy  1- labeled  histone  H3.  Western  blotting  confirms 
presence  of  SETDB1  and  KAP1  PB  proteins  in  the  HMTase  reaction.  Arrowhead  indicates  the 
position  of  mono- sumoylated  PB. 
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(E)  KRAB-ZFP  bound  to  the  promoter  of  its  target  gene  interacts  with  corepressor  KAP1  via  its 
N-terminal  RBCC  region.  KAP1  PHD  domain  functions  as  an  intramolecular  E3  ligase  by 
binding  to  Ubc9  (blue  arrow)  and  directing  sumoylation  of  the  adjacent  bromodomain.  The 
SUMO-conjugated  bromodomain  recruits  the  CHD3/NuRD  complex  and  SETDB1  through 
SUMO-SIM  interaction,  which  results  in  the  deacetylation  of  histones  and  the  methylation  of 
histone  H3-K9.  KAP1  -bound  HP1  recognizes  H3-K9  methylation  via  its  chromodomain  (CD) 
and  is  deposited  onto  the  promoter  in  order  to  silence  gene  expression.  Small  triangle  -  acetyl 
mark,  small  circle  -  H3-K9  tri-methyl  mark  on  histone  tails. 
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Fig.1  KAP1  is  Sumoylated  In  Vivo  and  In  Vitro  in  a  PHD-Dependent  Manner 
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Fig.2  Mapping  Sumoylation  Sites  and  Repression  Domains  in  KAP1 
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Figure  2 


Fig. 3  The  KAP1  PHD  Binds  to  Ubc9  with  High  Selectivity  and  Directs 
Sumoylation  of  the  Adjacent  Bromodomain 
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Fig.4  Mapping  of  Ubc9  Binding  Site  on  the  KAP1  PHD  Domain 
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Figure  4 


Fig. 5  Role  of  KAP1  Sumoylation  in  KRAB  Domain  Mediated  Repression 


LexA-Domain 


20 


15 


.  10 


Z  5 


kDa 

50 

40 

26 


I 


I 


■  vector 
a  Garni  WT 
□  Garni  mt 


III 


t 


i 


i* 


1  2  3 

4  5  6 

7  8  9 

10  11  12 

13  14  15 

16  17  18 

DBD 

PB 

KRAB 

POZ 

SNAG 

FOG 

► 

◄ 

—  ◄ 

►  — - 

WB:  LexA 


B 

FLAG-KAP1,  IP:  FLAG 

Myc-Gaml:  —  WT  mt  —  WT  mt  —  WT  mt 
kDa -  -  - 


WB:  FLAG 


50 


* 

◄—Garni 


Lysate,  WB:  Myc 
1  2  3 


C  D 


Gal4-KRAB 


Gal4-KRAB  +  FLAG-KAP1 


20 


15 


10 


■i:  5 


kDa 


WT  C651A  K6R  S1-K6R  :FLAG- 
i  KAP1 


o 

CO 

< 

Q 

00 

1 - 

n 

00 

< 

1 - 

Q 

- 1 

CO 

<r 

1 — 

o 

— 1 

00 

< 

1 — 

o 

— 1 

CO 

$ 

1 — 

o 

— 

00 

< 

CO 

oc 

CO 

:Gal4  kDa 

00 

(Y 

00 

Q0 

00 

(2 

00 

00 

Q 

o 

o 

XI 

Q 

Q 

o 

o 

MM 

120 

120 

WB:  KAP1 
12  3  4 


WB:  FLAG 

4  5  6  7  8  9  10 


Ivanov  A.  et  al 


Figure  5 


SUMO- 

KAP1 


Fig. 6  SETDB1  and  CHD3  Interact  with  Sumoylated  KAP1  in  Yeast  and 
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Fig. 7  KAP1  Sumoylation  Promotes  Recruitment  of  SETDB1  to  the  Target 
Promoter  and  Stimulates  Its  Enzymatic  Activity 
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Abstract 


The  SNAG  repression  domain,  named  for  its  presence  in  the  Snail/GFI-1  class  of  zinc 
finger  transcription  factors,  is  present  in  a  variety  of  proto-oncogenic  transcription  factors 
and  developmental  regulators.  The  prototype  SNAG  domain  containing  oncogene,  GFI-1 
(growth  factor  independence-1)  is  responsible  for  development  of  T-cell  thymomas. 
Significantly,  these  oncogenic  functions  require  a  functional  SNAG  repression  domain. 
However,  the  molecular  mechanism  of  SNAG  domain  mediated  transcriptional 
repression  that  is  responsible  for  these  biological  functions  is  completely  unknown. 
Using  a  yeast  two-hybrid  strategy,  we  identified  Ajuba  and  LIMD1  proteins,  which  can 
function  as  novel  SNAG  corepressors.  Ajuba  shuttles  between  the  cytoplasm  and  the 
nucleus  and  may  form  a  novel  intracellular  signaling  system.  Ajuba  interacts  with  SNAG 
in  vivo ,  co-localizes  with  it,  and  enhances  SNAG-mediated  transcriptional  repression.  We 
developed  a  strategy  to  recruit  the  mediators  of  SNAG  repression  to  an  expressed 
transgene  in  mammalian  cells  by  using  a  synthetic,  hormone-regulated  SNAG  repression 
domain.  We  show  that  in  presence  of  hormone,  the  transgene  is  rapidly  repressed  due  to 
the  assembly  of  a  multiprotein  complex  that  comprises  of  Ajuba,  LIMD1,  possibly 
histone  deacetylases  and  hitherto  unidentified  proteins  at  the  promoter.  We  suggest  that, 
in  this  molecular  interaction,  Ajuba  may  function  as  an  adapter  or  molecular  scaffold  for 
the  assembly  of  a  macromolecular  repression  complex  at  the  target  promoters  while  the 


SNAG  domain  may  serve  as  a  nuclear  targeting  motif  in  the  transport  of  Ajuba  into  the 
nucleus. 

Introduction 

Generally,  the  transcription  factors  are  modular  in  architecture;  possess  sequence- specific 
DNA  binding-,  and  functionally  separable  effector-domains.  The  Cys2-His2  (C2H2)  type 
zinc  finger  proteins  (ZFPs)  constitute  a  family  of  transcription  factors,  which  can  be 
classified  into  distinct  sub-families  such  as  SNAG-,  KRAB-,  and  BTB/POZ-ZFP  based 
on  the  highly  conserved  functional  domains  they  possess  at  their  amino  termini. 
Mechanisms  followed  during  transcriptional  regulation  include:  1)  direct  interaction  with 
the  components  of  basal  transcription  machinery,  2)  ATP-dependent  remodeling  of  the 
chromatin,  and  3)  site-specific  modifications  such  as  acetylation,  phosphorylation, 
methylation  and  ubiquitination  of  the  nucleosomal  histones,  which  serve  as  biochemical 
codes  in  the  context  of  complex  chromatin  structure.  A  widely  accepted  paradigm  is  that 
histone  acetylation  leads  to  gene  activation  while  histone  deacetylation  results  in 
repression.  Often,  the  domains  mediating  the  repression  are  proline-,  alanine-,  or 
glycine-rich  (1).  Importantly,  many  repressors  function  through  corepressor  molecules  as 
exemplified  by:  DR-DRAP- 1  (2),  WRPW-Groucho  (3),  REST-Co-REST  (4)  and  KRAB- 
KAP-1  (5). 

SNAG  repression  domain  (Snail/GFI- 1 )  is  present  in  the  vertebrate  homologues 
of  the  Snail/Slug  and  in  the  GFI-1  (growth  factor  independence-I)  ZFPs  (6-9).  The  GFI- 
1  proto-oncogene  was  cloned  by  an  insertional  mutagenesis  strategy  wherein  the 
Moloney  murine  leukemia  virus-infected  T-cells  selected  for  IL-2-independence  showed 


non-random  integration  of  the  provirus  at  the  GFI-1  locus  (10).  Remarkably,  GFI-1  was 
independently  cloned  as  a  gene,  which  could  cooperate  with  myc  and  piml  in  transgenic 
models  of  B-  and  T-cell  lymphoma  (11).  The  GFI-1  is  a  nuclear  protein,  recognizes  a  12 
bp  consensus  sequence  (12).  Potential  target  genes  have  been  identified  based  on  the 
presence  of  this  recognition  sequence  in  their  promoter/enhancer  regions  (13).  Full- 
length  GFI-1  represses  reporter  plasmids  containing  these  binding  sites  in  multiple  cell 
lines.  Furthermore,  the  20-amino  acid  SNAG  domain  alone  fused  to  the  zinc  finger 
region  was  sufficient  for  repression  and  single  amino-acid  substitutions  in  the  SNAG 
domain  abolished  this  repression  function  (9).  Consistent  with  its  role  as  a  T-cell  tropic 
oncogene,  overexpression  of  GFI-1  in  immortalized  T-cells  allowed  them  to  escape  the 
G1  arrest  induced  by  IL-2  withdrawal  and,  interestingly,  a  SNAG-domain  mutant  of  GFI- 
1  was  inactive  in  this  capacity  (9).  Gene  knockout  studies  have  demonstrated  that 
mammalian  GFF1  genes  are  essential  for  the  development  of  erythroid  and 
megakaryocytic  lineages  (14,  15).  A  recent  study  has  shown  that  GFI-1  is  absolutely 
essential  for  neutrophil  differentiation  (16). 

Several  vertebrate  homologues  of  the  fly  Snail  and  Slug  genes  have  been  cloned 
and  their  roles  in  development  studied  (17).  In  Xenopus,  the  Snail/Slug  family  has  been 
shown  to  play  essential  roles  in  both  mesoderm  differentiation  and  in  neural  crest 
induction/migration  and  these  functions  require  a  competent  SNAG  domain.  Both  FGF 
and  HGF  induce  the  Slug  gene’s  expression,  which  in  turn  induces  epithelial- 
mesenchymal  transitions  (EMT)  (18).  The  E2A-HLF  fusion  protein  has  been  shown  to 
activate  the  human  Slug  gene  with  resultant  IL-3-independent  survival  and  transformation 
of  pro-B  cells  (19).  Snail  family  members  also  play  crucial  roles  in  the  development  of 


mesoderm  and  nervous  system  by  triggering  EMT.  Recent  studies  have  shown  that  the 
human  and  mouse  E-cadherin  promoters  are  direct  targets  for  Snail  ZFPs  and  the  loss  of 
E-cadherin  expression  is  a  major  contributor  to  acquisition  of  an  invasive,  highly 
malignant  phenotype  during  human  tumor  progression  (20,  21).  Thus,  it  is  clear  that  the 
SNAG-ZFPs  play  important  and  diverse  roles  in  embryonic  development  and  in  human 
disease.  Despite  these  advances  in  defining  the  roles  of  SNAG  domain  proteins  in 
biological  processes,  the  molecular  mechanisms  of  SNAG-mediated  transcriptional 
repression  are  still  completely  unknown. 

Using  a  yeast  two-hybrid  screen,  we  have  identified  Ajuba  (22,  23)  and  LIMD1 
(24)  proteins  as  SNAG  domain  interactors.  We  have  demonstrated  that  Ajuba  functions 
as  a  SNAG  domain-specific  corepressor  by  performing  thorough  biochemical 
characterizations.  We  suggest  that  these  LIM  proteins  mediate  the  repression  function  of 
SNAG-ZFPs  by  assembling  a  macromolecular  repression  complex  at  the  target 
promoters. 

Materials  and  Methods: 

Cell  lines  and  Antibodies:  The  COS-1,  NIH3T3  and  its  stable  cell  line  derivatives 
(SPHL11  and  SPHL20)  were  grown  as  described  (25).  The  a-PAX3  (26)  and  oc-LEXA 
IgGs  (Santa  Cruz  Biotechnology)  are  rabbit  and  goat  polyclonal  antibodies  that  recognize 
the  PAX3  and  LEXA  DNA-binding  domains  (DBD),  respectively.  The  a-Ajuba,  a- 
LIMD1  are  polyclonal  antibodies  made  by  immunizing  rabbits  with  6-His  fusion  proteins 
of  mouse  Ajuba  (amino  acids  1-216)  and  mouse  LIMD1  (amino  acids  1-158)  as  antigens. 
These  antibodies  do  not  cross-react  with  each  other.  The  a-MYC  tag  (clone  9E10),  a- 


AcH3,  a-AcH4  (raised  against  acetylated  histone  H3  &  H4)  and  a-H3-MeK9  (raised 
against  lys9-methylated  histone  H3)  antibodies  were  purchased  (Upstate). 

Construction  of  a  SNAG  domain  minigene  and  its  derivatives:  A  synthetic  SNAG 
domain  coding  sequence  derived  from  GFI-1  was  constructed  by  overlap-extension-PCR. 
A  Kozak  consensus  was  added  to  the  initiator  methionine  of  the  20-amino  acid  SNAG 
domain  (MPRSFLVKSKKAHSYHQPRS)  plus  a  six  amino  acid  spacer  (PGPDYS)  and 
in-frame  EcoR  I  and  Sal  I  sites  were  included  for  cloning  into  pSP73  vector.  Next,  we 
performed  alanine-scanning  mutagenesis  for  the  analysis  of  the  SNAG  repression 
domain.  Using  appropriate  mutant  oligos,  we  first  changed  the  FLV  to  AAA  and  KK  to 
AA;  later  mutations  involved  single  amino  acid  substitutions  S-A  and  F-A  as  illustrated 
in  Figures  1C  and  2A.  These  wild  type  (wt)  and  mutant  SNAG  minigenes  were  used  to 
construct  PAX3-,  FEXA-,  and  pBTMl  16-FEXA  fusions. 

Immunoprecipitation  (IP)  and  Electrophoretic  Mobility  Shift  Assays  (EMSA):  Each 
expression  plasmid  was  verified  for  stable  protein  expression  in  vivo  as  described  (25). 
Briefly,  [l5S]-L-methionine  labeled  whole  cell  extracts  prepared  from  transiently 
transfected  COS-1  cells  were  immunoprecipitated  with  a-PAX3  IgG  or  a-LEXA  IgG, 
and  analyzed  by  SDS-PAGE  and  fluorography.  In  co-immunoprecipitation  experiments, 
subsequent  to  co-transfection  of  either  SNAG-PAX3  or  SNAG-LEXA  with  MYC-Ajuba 
plasmid  (1:1  ratio),  immunoprecipitation  was  carried  out  in  mild  conditions  using  ELB 
buffer  (27).  Nuclear  extracts  were  prepared  using  a  rapid  protocol  (26)  and  used  in 
EMSA  with  32P-labeled  e5  (PAX3  site  derived  from  the  engrailed  gene)  probe. 
Transcriptional  repression  assays:  To  monitor  the  repression  potentials  of  the  chimeric 
SNAG  repressor  proteins,  2  x  105  NIH3T3  cells  were  transfected  with  1  ug  of  the 


expression  plasmid  along  with  0.5  pg  of  PAX3-  or  LEXA-luciferase  reporter  plasmids, 
and  0.25  pg  of  pCMV-LacZ  plasmids,  using  lipofectAMINE  (Life  Technologies). 
Whole  cell  extracts  were  assayed  for  luciferase  activities  and  then  normalized  to  |3- 
galactosidase  values  for  transfection  efficiency  (25,  26).  Fold  repression  was  determined 
as  the  ratio  of  normalized  light  units  in  vector  versus  SNAG-expression  plasmid 
transfected  cells.  To  examine  the  effect  of  Ajuba  on  SNAG-PAX3  or  SNAG-LEXA- 
mediated  repression,  co-transfection  was  performed  as  above. 

Immunofluorescence  and  co-localization:  Immunocytochemistry  was  performed 
essentially  as  described  (27).  NIH3T3  cells  were  transfected  with  the  indicated 
expression  plasmids,  fixed,  permeabilized,  and  then  incubated  with  a-LEXA  (1:100 
dilution)  or  a-MYC  (1:1000  dilution)  antibodies.  The  immunostained  cells  were 
detected  with  FITC-conjugated  oc -rabbit  IgG  (1:500  dilution)  or  Texas  Red-conjugated  a- 
mouse  IgG  (1:1000  dilution),  respectively.  Finally,  the  cells  were  stained  for  DNA  with 
Hoechst  (2  ng/ml)  and  mounted  on  glass  slides  using  Fluoromount  G  (Southern 
Biotechnology  Associates,  Inc).  Cells  were  visualized  using  a  Leica  confocal  laser¬ 
scanning  microscope.  Two  channels  were  recorded  simultaneously  if  no  cross  talk  could 
be  detected.  Images  were  captured  using  QED  Imaging  software. 

Chromatin  Immunoprecipitation  (ChIP):  SPHL11  or  SPHL20  cells  were  plated  at  5  x 
106  cells/  150-mm  dish,  treated  continuously  with  either  500  nM  4-OHT  (+OHT  dishes) 
or  0.1%  ethanol  (-OHT  dishes)  for  2  days,  and  then  fixed  in  1%  formaldehyde  (EM 
Biosciences)  for  20  min  at  37°C.  Solubilized,  sonicated  chromatin  was  prepared  as 
described  (28),  and  immunoprecipitated  with  a-PAX3,  a-Ajuba,  a-LIMDl,  a-AcH3,  a- 
AcH4  or  a-H3-MeK9  antibodies.  Usually,  10%  of  the  clarified  chromatin  was  saved  as 


input.  The  immune  complexes  were  processed  as  described  (28).  Both  the  input  and  the 
immunoprecipitated  DNAs  were  used  in  quantitative  PCR  reactions  with  the  primer-pairs 
illustrated  in  Figure  5A.  The  PBS1  (5’  G  ATCG  AT  A  ATT  CG  AGCT  ACT  G  3’)  and 
PBS2  (5’  G AGCT CGGT ACCCGGGT CG  3’)  primer-pair  amplify  the  PAX3  binding 
sites;  the  primer-pair  TKP1  (5’  GCGCGGTCCCAGGTCCACTT  3’)  and  LUC1  (5’ 
T  CC  AGG  A  ACC  AGGGCGT  AT  CTCT  3’)  amplify  the  HSV-TK  promoter  region.  The 
DNA  fragments  were  electrophoresed  on  1.5%  agarose  gels  and  either  photographed  or 
Southern-blotted  and  autoradiographed. 

Results 

Characterization  of  engineered  SNAG  repressors:  SNAG  transcription  factor 
superfamily  includes  Snail,  Slug,  GFI-1,  Scratch,  and  IA-1  ZFPs.  In  spite  of  possessing  a 
varied  number  of  C2H2-type  zinc  fingers  at  their  COOH-termini,  they  all  contain  a  highly 
conserved  NH2-terminal  SNAG  repression  domain  (Figure  1A).  Multiple  sequence 
alignment  using  Clustal  W  indicates  that  the  SNAG  domain  homology  extends  to  ~20 
amino  acids  with  the  NH2-terminal  first  seven  amino  acids  being  strictly  conserved 
between  the  different  members  (Figure  IB).  Further,  the  SNAG  domains  of  GFI-1  and 
GF1-1B  genes  also  possess  a  well-conserved  nuclear  localization  signal  (KSKK).  To 
determine  if  the  synthetic  SNAG  domain  would  function  as  a  transferable,  modular 
repression  domain,  we  fused  it  to  the  NH2  terminus  of  the  minimal  PAX3  DNA  binding 
domain  (PAX3)  (Figure  1C).  We  have  used  the  PAX3  as  a  recipient  for  the  following 
reasons,  the  PAX3:  1)  binds  DNA  as  a  monomer  and  recognizes  an  extended  non¬ 
degenerate  DNA  binding  site  due  to  paired-  and  homeodomain-DNA  binding  motifs,  2) 


is  easily  detectable  using  PAX3  antibodies,  and  3)  is  essentially  neutral  when  bound  to 
DNA  in  the  absence  of  an  effector  domain.  We  used  the  alanine-scanning  mutagenesis  to 
map  the  amino  acid  sequence  requirement  for  active  repression.  We  targeted  both  the 
extreme  NH2  terminal  block  of  seven  amino  acids,  which  is  identical  among  all  SNAG 
domains  and  the  more  COOH-terminal  region,  which  is  not  as  highly  conserved.  After 
PCR-mediated  site-directed  mutagenesis  and  DNA  sequence  analysis,  they  were  sub¬ 
cloned  into  the  PAX3  constructs  (Figure  1C).  Each  protein  was  comparably  expressed 
(Figure  ID)  and  showed  DNA  binding  activity  to  e5  site  to  the  same  extent  (Figure  IE). 
Expression  plasmids  encoding  SNAG-PAX3  fusions  were  co-transfected  with  the  PAX3- 
luciferase  reporter  plasmid  and  cell  extracts  assayed  for  activity.  As  depicted  in  Figure 
IF,  fusion  of  the  SNAG  domain  to  the  NH2-terminus  of  PAX3  created  a  potent  repressor. 
Repression  was  dose-dependent  and  was  strictly  contingent  upon  PAX3  binding  sites  in 
the  reporter  plasmid  (data  not  shown).  However,  the  mutants  varied  in  their  abilities  to 
repress  the  PAX3-lucif erase  in  reporter  assays  (Figure  IF).  The  FLV-AAA  substitution 
completely  abolished  the  repression.  However,  the  S-A  and  KK-AA  mutants  retained 
significant  repression  activity  whereas  the  single  substitution  F-A  abolished  repression 
function.  Thus,  phenylalanine  to  alanine  substitution  in  the  highly  conserved  SNAG 
domain  converts  a  normally  powerful  repressor  of  transcription  into  a  neutral  DNA 
binding  protein.  Whether  the  seven-amino  acid  NH2  terminal  segment  is  sufficient  for 
repression  remains  to  be  tested.  Interestingly,  a  modular  repression  domain  as  small  as  4 
amino  acids  (WRPW)  present  in  the  hairy  and  achaete-scute  bHLH  transcription  factors 
is  sufficient  to  confer  repression  by  recruiting  the  Groucho  corepressor  (3).  We  suggest 
that  the  SNAG  domain  may  function  in  a  similar  manner. 


A  Yeast  Two-Hybrid  Strategy  to  identify  SNAG  domain- Associated  Proteins 

(SNAPS):  We  reasoned  that,  like  other  repression  domains,  direct  protein-protein 
interactions  might  be  required  for  SNAG  mediated  repression.  One  of  the  strategies  for 
isolating  a  SNAG-specific  corepressor  is  to  perform  a  yeast-two  hybrid  screen  using  the 
20-amino  acid  SNAG  domain  as  bait  and  to  counter-screen  the  preys  with  the  SNAG 
domain  mutants  that  lack  repression  function.  We  generated  a  SNAG-LEXA  fusion  by 
placing  the  SNAG  domain  at  the  NH2-terminus  of  the  LEXA  DNA  binding  domain 
(Figure  2A).  In  order  to  limit  the  spectrum  of  potential  protein-protein  interactions 
critical  for  SNAG  function,  we  only  used  the  20-amino  acid  SNAG  domain  and  not  the 
full  NH2  terminus  of  GFI-1.  However,  in  order  to  ensure  that  there  were  no  spatial 
constraints  to  protein  binding  we  included  a  small  spacer  of  17  amino  acids  of  PAX3 
between  the  SNAG  domain  and  the  beginning  of  the  LEXA.  We  also  constructed  mutant 
SNAG-LEXA  fusions  with  two  mutants  that  abolished  repression,  and  two  mutants, 
which  showed  wild-type  levels  of  repression  activity  (Figure  2A).  The  SNAG-LEXA 
fusions  were  properly  expressed  in  mammalian  cells  (Figure  2A),  exhibited  proper 
binding  to  a  LEXA  DNA-binding  site  in  gel  shift  assays  (data  not  shown),  and  repressed 
the  LEXA-luciferase  reporter  plasmids  (Figure  2A).  Thus,  this  spectrum  of  “baits” 
formed  the  important  set  of  reagents  with  which  to  identify  and  distinguish  true  mediators 
of  SNAG  repression. 

First,  we  showed  that  the  pBTMl  16-SNAG-LEXA  constructs  neither  activated 
nor  repressed  transcription  in  yeast  (data  not  shown).  The  wt  SNAG-LEXA  fusion  was 
then  co-transformed  with  a  murine  embryonic  (E9-E1 1)  library  constructed  in  pVP16  and 
scored  for  histidine  prototrophy  and  LacZ  activity  in  a  standard  two-hybrid  screen.  From 


~  200  primary  hits  that  were  obtained  upon  screening  ~40  million  library  clones,  we 
isolated  ~20  that  interacted  strongly  with  the  wt  SNAG  domain.  Several  of  these  were 
re-screened  by  reintroduction  into  naive  yeast  with  a  variety  of  other  baits.  As  shown  in 
Figure  2B,  SNAP13  and  SNAP20  interacted  very  strongly  with  wt  SNAG  but  failed  to 
interact  with  the  SNAG  (FLV-AAA)  mutant,  LEXA,  or  any  of  the  other  negative  control 
baits  including  KAP-1,  Lamin,  Rho  and  PKC.  We  further  characterized  SNAP  13  and 
SNAP20  by  reintroducing  them  into  naive  yeast  along  with  each  of  the  pBTM116- 
SNAG-LEXA  baits  (mating  assay).  As  shown  in  Figure  2C,  their  interactions  were 
almost  completely  concordant  with  repression  activity;  each  interacted  with  active  SNAG 
repressors  (wt  and  S-A)  but  did  not  interact  with  the  mutants  (FLV-AAA  and  F-A)  which 
abolish  repression.  The  exception  to  this  concordance  is  KK-AA,  which  is  expressed  at  a 
reduced  level  in  yeast  (data  not  shown).  Thus,  by  these  criteria  these  clones  are  strong 
candidates  for  mediators  of  SNAG  function. 

We  rescued  the  inserts,  and  the  deduced  DNA  sequence  was  in-frame  with  the 
VP  16  activation  domain  encoded  by  the  vector.  In  the  primary  screen,  we  obtained  two 
independent  hits  of  SNAP  13  and  multiple  but  identical  isolates  of  SNAP20.  BLAST 
searches  revealed  that  SNAP  13  and  SNAP20  were  identical  to  the  two  NH2-terminal  LIM 
domains  of  Ajuba  and  LIMD1,  respectively.  LIM  is  an  acronym  of  three  transcription 
factors,  Linll,  Isl-1,  and  Mec-3,  in  which  the  motif  was  first  identified  (29).  The  LIM 
domain  protein  sub-family  that  includes  Ajuba  and  LIMD1  is  illustrated  in  Figure  2D. 
The  Ajuba,  LIMD1,  and  other  three-LIM  domain  proteins  such  as  Zyxin,  TRIP6,  LPP 
contain  characteristic  glycine/proline-rich  regions  and  nuclear  export  signals  (NES). 
They  belong  to  Group  3  LIM  domain  family  whose  members  exhibit  nucleocytoplasmic 


distribution  and  mediate  both  cellular  and  nuclear  signaling  events.  LIM  domains  are 
bona  fide  protein-protein  interaction  motifs,  which  encode  a  signature  Cys2-Hisl-Cys3- 
Cys  Zn2+  binding  domain  as  shown  for  the  three  individual  LIM  domains  of  Ajuba 
(Figure  2E).  A  dendrogram  analysis  constructed  from  the  LIM  domains  of  the  three 
LIM  domain  proteins  revealed  that  a  distinct  phylogenetic  relationship  exists  only 
between  the  LIM  domains  of  Ajuba  and  LIMD1  (Figure  2F).  The  SNAG-interacting 
LIM  domains  of  Ajuba  and  LIMD1  (about  134  amino  acids)  fell  into  a  single  group  with 
a  similarity  index  of  62.7,  implying  that  they  evolved  from  a  common  ancestral  gene. 
Ajuba  interacts  with  SNAG  in  vivo  and  functions  as  a  corepressor:  To  verify  whether 
the  SNAG-LIM  domain  interactions  occur  in  vivo  in  mammalian  cells,  we  used 
expression  vectors  for  SNAG-PAX3,  SNAG-LEXA,  and  MYC-epitope  tagged  full  length 
Ajuba  (MYC-Ajuba).  Proper  expression  of  the  MYC- Ajuba  protein  was  confirmed  by 
immunoprecipitation  with  a-MYC  MAb  (data  not  shown).  Next,  we  co-transfected  the 
MYC-Ajuba  and  wt  or  mutant  SNAG-LEXA  plasmids  into  COS-1  cells.  The  35S- 
methionine-labeled  cell  lysates  were  immunoprecipitated  with  either  a-MYC  or  a-LEXA 
IgG.  Figure  3A  shows  that  the  a-MYC  detects  a  66-kDa  MYC-Ajuba  protein  while  the 
a-LEXA  detects  a  25-kDa  LEXA  protein  in  the  MYC-Ajuba+LEXA  vector  co¬ 
transfected  cells.  However,  in  the  wt  SNAG-LEXA+MYC-Ajuba  co-transfected  cells, 
the  a-LEXA  also  detects  the  MYC-Ajuba  protein.  As  expected,  the  a-MYC  also  detects 
the  wt  SNAG-LEXA  protein  in  the  same  cell  population.  Strikingly,  similar  co- 
immunoprecipitation  experiments  with  mutant  SNAG-LEXA  plasmids  indicated  that  the 
repression-incompetent  mutants  of  the  SNAG  domain  fail  to  bind  the  Ajuba,  strongly 
suggesting  that  this  interaction  is  functionally  relevant  (Figure  3A). 


To  confirm  that  the  region  of  Ajuba  required  for  SNAG  interaction  contained  the 
LIM  domain,  we  used  the  expression  plasmids  coding  for  the  NH2-  or  COOH-terminal 
segments  of  Ajuba  (defined  as  “Ajuba-PreLIM”  or  “Ajuba-LIM”)  in  co-transfection 
experiments  along  with  the  SNAG-PAX3  plasmid.  We  observed  that  the  a-PAX3  sera 
detects  the  SNAG-PAX3  and  also  co-precipitates  the  Ajuba  protein  or  the  Ajuba-LIM 
domain  but  not  the  Ajuba-preLIM  domain,  proving  that  the  LIM  region  of  Ajuba  is  the 
site  of  SNAG  domain  interaction  (data  not  shown).  Thus,  Ajuba-SNAG-PAX3 
complexes  are  very  efficiently  formed  in  vivo  following  co-transfection  and  are  stable  to 
moderate  salt,  detergent  extraction  procedures.  Importantly,  this  co-precipitation  was  not 
detected  when  a  PAX3  gene  that  lacks  a  SNAG  domain  was  used,  confirming  that  a 
SNAG- Ajuba  interaction  is  occurring  (data  not  shown). 

We  performed  co-transfection  experiments  to  determine  if  Ajuba  can  modify  the 
transcriptional  repression  function  of  the  SNAG  domain.  We  transfected  CMV  vectors 
for  SNAG-PAX3  or  SNAG-LEXA  constructs  into  NIH3T3  cells,  with  or  without  CMV- 
Ajuba.  SNAG-LEXA  repressed  the  LEXA-luciferase  reporter,  and  this  repression  was 
enhanced  by  CMV- Ajuba.  As  expected,  Ajuba  also  enhanced  SNAG-PAX3  mediated 
repression  of  the  PAX3-luciferase  reporter  (Figure  3B).  These  promising  results  suggest 
that  Ajuba  mediates  SNAG  repression  and  functions  as  a  SNAG  corepressor. 
Recruitment  and  Nuclear  Co-localization  of  Ajuba  and  SNAG  in  NIH3T3  cells:  The 
functional  studies  prompted  us  to  investigate  the  sub-cellular  localization  patterns  of  the 
MYC-Ajuba,  the  wt  and  mutant  SNAG-LEXA  proteins.  Immunofluorescence  analysis 
revealed  that  cells  transfected  with  MYC-Ajuba  showed  predominant  cytoplasmic 
staining  and  those  that  were  transfected  with  SNAG-LEXA  (wt  and  mutants)  showed 


intense  nuclear  staining  (Figure  4A).  However,  cells  that  received  both  Ajuba  and 
SNAG-LEXA  plasmids  showed  abundant  nuclear  staining  of  Ajuba  that  was  solely 
dependent  on  the  repression  competency  (Figure  4B).  A  majority  of  cells  in  this  doubly 
transfected  population  showed  the  pattern  observed  in  Figure  4B.  These  remarkable 
images  strongly  corroborate  our  functional  studies  described  earlier.  These  results  also 
suggest  that  the  SNAG  domain  can  recruit  and/or  retain  Ajuba  in  the  nucleus,  and  are 
consistent  with  our  earlier  observations  that  Ajuba  is  both  cytoplasmic  and  nuclear. 
Recent  reports  suggest  that  Group  3  LIM  proteins  like  Zyxin  and  Ajuba  can  shuttle 
between  the  cytoplasm  and  the  nucleus  and  thus  may  form  a  novel  intracellular  signaling 
system.  It  has  been  hypothesized  that  there  are  nuclear  “anchors”  for  this  class  of 
shuttling  LIM  proteins  (30).  We  hypothesize  and  present  evidence  that  the  SNAG 
domain  may  be  such  an  anchor  in  case  of  Ajuba  and  LIMD1. 

Molecular  components  of  SNAG  repression  are  recruited  to  the  target  promoter: 

We  investigated  whether  the  SNAG  domain  mediated  transcriptional  repression  involves 
chromatin  modification  by  using  trichostatin-A  (TSA),  a  specific  inhibitor  of  histone 
deacetylase  (HD AC)  enzyme.  We  generated  stable  cell  lines,  which  contain  a  chromatin- 
integrated  PAX3-luciferase  reporter  transgene  and  a  conditional  SNAG  repressor 
(SNAG-PAX3-HBD)  (SPHBD)  (Figure  5A).  We  tested  multiple  cell  lines  for  protein 
expression  and  4-OHT  dependent  repression  of  the  PAX3-luciferase  reporter  (25)  (data 
not  shown).  We  chose  the  SPHL1 1  and  SPHL20  clones  to  study  the  effect  of  TSA  due  to 
their  repression  potentials.  In  presence  of  4-OHT,  about  six-fold  repression  was 
observed,  however,  in  presence  of  both  4-OHT  and  TSA,  this  repression  was  totally 
relieved  in  both  clones  (data  not  shown).  This  experiment  suggests  that  one  of  the 


mechanisms  of  SNAG  domain  repression  involves  chromatin  modification.  To  confirm 
these  biochemical  evidences  of  SNAG  repression  mechanisms  at  the  molecular  level,  we 
performed  comprehensive  chromatin  immunoprecipitation  (ChIP)  experiments  using  a 
battery  of  antibodies  raised  against  putative  molecular  components.  Chromatin- 
associated  proteins  were  cross-linked  to  DNA  in  vivo  with  formaldehyde  in  mock  or  4- 
OHT  treated  SPHL11  and  SPHL20  cells  and  immunoprecipitated  with  a-PAX3,  a- 
Ajuba,  a-LIMDl,  a-AcH3,  a-AcH4,  and  a-MeK9  antibodies.  The  immunoprecipitated 
DNA  was  analyzed  by  quantitative  PCR,  using  primer  pairs  for  the  6xPAX3  binding  sites 
(PBS1&PBS2),  or  the  TK  promoter  regions  (TKP1&LUC1)  of  the  CD19-TK-LUC  ZeoR 
locus.  Fragments,  which  bracket  the  PAX3  binding  sites,  were  considerably  higher  in  the 
PAX3  immunoprecipitates  (IPs)  after  4-OHT-treatment  indicating  that  there  was  a  strong 
enrichment  of  the  SPHBD  protein  at  the  PAX3  sites.  This  experiment  was  controlled  by 
the  presence  of  similar  levels  of  PAX3  sites  in  the  input  chromatin  (Figure  5B). 
Likewise,  we  observed  a  strong,  4-OHT-dependent  enrichment  of  the  257  bp  TK 
promoter  fragment  in  a-Ajuba,  a-LIMDl,  and  a-MeK9  chromatin  IPs  (Figure  5C).  Thus, 
in  addition  to  the  DNA  binding  component,  other  components  of  the  SNAG  repression 
complex  (i.e.  Ajuba,  LIMD1  and  H3-MeK9)  were  inducibly  recruited  to  the  target  gene. 
Based  on  these  results,  we  conclude  that  the  SPHBD  fusion  protein,  and  Ajuba,  LIMD1, 
H3-MeK9  are  strongly  recruited  to  the  chromatin  regions  comprising  the  PAX3  binding 
site,  and  the  TK  promoter  region,  respectively. 

In  parallel  ChIP  experiments,  we  observed  significant  reduction  in  the  levels  of 
acetylated  histone  H3  and  H4  at  the  TK  promoter  region  upon  4-OHT-treatment, 
indicating  that  the  histone  hypoacetylation  occurs  upon  recruitment  of  SPHBD  protein  to 


the  upstream  PAX3  sites  (Figure  5D).  These  results  are  in  agreement  with  the  complete 
reversal  of  SNAG  repression  we  observed  in  presence  of  TSA.  Future  chromatin 
immunoprecipitation  experiments  will  unravel  the  identity  of  the  HDAC  involved  in 
SNAG  repression.  We  expect  these  experiments  to  yield  clues  about  the  nature  of  the 
unique  nucleocytoplasmic  signaling  mechanism  operated  by  Ajuba  and  LIMD1  proteins. 

Discussion 

Mammalian  zinc  finger  transcription  factors  play  vital  roles  during  organism 
development  and  subsequent  homeostasis.  In  order  to  gain  insights  into  the  molecular 
mechanisms  of  SNAG-mediated  repression  that  are  responsible  for  the  biological 
functions  of  SNAG-ZFPs,  we  isolated  and  characterized  SNAG-specific  corepressors. 
The  modular  nature  of  SNAG  and  other  repression  domains  such  as  KRAB,  BTB/POZ 
aids  to  isolate  them  away  from  the  other  functional  elements  present  in  the  full-length 
proteins  and  study  them  in  detail.  We  followed  this  approach  in  our  earlier  studies  to 
understand  the  mechanistic  details  of  the  KRAB  mediated  repression  by  isolating  and 
characterizing  the  KAP-1  (KRAB  Associated  Protein-1)  corepressor  (5,  27).  In  this 
study,  we  have  identified  Ajuba  and  LIMD1  proteins  as  downstream  players  in  the 
SNAG  repression  pathway. 

Subsequent  to  the  initial  identification  of  LIM  domain  proteins  Ajuba  and  LIMD1 
as  SNAG-domain  interactors  in  the  yeast  two-hybrid  assay,  we  performed  extensive 
biochemical  analyses  to  confirm  this  interaction.  Our  striking  results  from  co- 
immunoprecipitation  experiments  indicate  that  SNAG  domain  interacts  avidly  with 
Ajuba  in  vivo  and  also  that  Ajuba  can  enhance  SNAG-mediated  repression.  Interestingly, 


we  observed  these  results  only  with  repression-competent  SNAG  derivatives.  The  KAP1 
corepressor,  which  interacts  only  with  repression-competent  KRAB  domains  also 
enhance  KRAB-mediated  repression.  Since,  Ajuba  possesses  these  important 
characteristics  of  corepressor  proteins  similar  to  KAP1,  we  designate  Ajuba  as  a  SNAG- 
corepressor. 

LIM  domain  interactions  have  been  observed  in  several  classes  of  transcription 
factors  and  cofactors.  Specific  examples  include:  1)  The  nuclear  LIM-only  (LMO) 
protein  interact  with  GATA-1,  2  and  Tall/Scl  transcription  factors  (31);  2)  The  LIM- 
domain  binding  protein  Ldbl,  its  LIM-only  protein  partner  LM02,  and  the  DNA-binding 
SCL/E12  exist  in  a  multiprotein  complex  that  negatively  regulates  erythroid  function 
(32);  3)  FHL2/DRAL  interacts  with  the  promyelocytic  leukemia  zinc-finger  (PLZF) 
protein  (a  sequence- specific  transcriptional  repressor)  and  functions  as  a  corepressor  by 
augmenting  PFZF-mediated  transcriptional  repression  (33);  4)  FHF2  associates  with  Jun 
and  Fos  and  serves  as  a  co-activator  of  AP-1  by  stimulating  Fos-  and  Jun-dependent 
transcription  (34);  5)  The  CFIM  forms  a  protein  complex  consisting  of  FIM-HD,  FMO, 
bHFH,  GATA,  at  the  target  promoters  (35);  6)  The  FIM-homeodomain  transcription 
factor  Fhx3  binds  to  the  pituitary-specific  transcription  factor  Pitx-1  and  also  interacts 
with  several  co-activator/adapter  proteins  including  the  SFB,  selective  FIM-binding 
protein  (36);  and  7)  Cysteine-rich  FIM-only  proteins  (CRP1  and  CRP2)  play  important 
roles  in  organizing  multiprotein  complexes,  both  in  cytoplasm,  where  they  participate  in 
cytoskeletal  remodeling,  and  in  the  nucleus,  where  they  strongly  facilitate  smooth  muscle 
differentiation  (37).  These  studies  clearly  demonstrate  that  nuclear  FIM  proteins  can 


function  as  adapter  molecules  in  the  formation  of  large  multiprotein  complexes  that  form 
on  DNA  and  that  influence  transcription. 

Our  co-localization  studies  demonstrate  that  Ajuba  by  itself  remains  totally 
cytoplasmic,  however,  in  the  presence  of  repression  competent  SNAG  domains,  its 
localization  completely  changed  to  nuclear.  We  suggest  that  SNAG  domain  facilitates 
both  import  of  Ajuba  into  the  nucleus  and  its  subsequent  retention  there  by  serving  as  a 
nuclear  anchor.  Similar  nuclear  targeting  functions  have  been  observed  for  other  LIM 
proteins.  For  example:  1)  Stimulation  of  the  Rho  signaling  pathway  induces 
translocation  of  the  transcriptional  LIM-only  coactivator  FHL2  to  the  nucleus  (38);  2)  In 
heart,  FHL2  interacts  with  hNP220,  a  DNA-binding  nuclear  protein  to  serve  as  a 
molecular  adapter  in  the  formation  of  a  multiprotein  complex  (39);  3)  The  LIM  protein 
KyoT2,  an  alternatively  spliced  murine  isoform  of  SLIM1  has  been  shown  to  negatively 
regulate  transcription  by  interaction  with  RBP-J  DNA  binding  protein  and  displacing  it 
from  the  DNA  (40,  41);  4)  Interaction  of  Zyxin  (usually  found  in  focal  adhesions)  with 
the  human  papillomavirus-derived  E6  protein  leads  to  its  accumulation  in  the  nucleus 
where  it  functions  as  a  transcriptional  activator.  Significantly,  this  interaction  requires 
the  three  LIM  domains  present  at  the  carboxyl  terminus  of  Zyxin  (42);  and  5)  The 
Lipoma-Preferred  Partner  (LPP)  protein  exhibits  nucleocytoplasmic  distribution  and 
possesses  focal  adhesion  and  nuclear  targeting  capabilities.  It  does  not  contain  any 
consensus  nuclear  localization  signals  suggesting  that  it  may  be  imported  into  the  nucleus 
via  a  nuclear  localization  signal  containing  transport  protein  (43). 

The  co-localization  studies  also  suggested  that  upon  SNAG-mediated 
translocation,  Ajuba  along  with  its  (unidentified)  associated  proteins  could  constitute  a 


macromolecular  repression  complex  at  the  target  promoter.  Based  on  our  findings  from 
the  chromatin  immunoprecipitation  experiments,  we  favor  the  notion  that  the  initial 
DNA-binding  by  the  SNAG-PAX3  repressor  at  the  PAX3  binding  sites  leads  to  the 
subsequent  recruitment  of  Ajuba,  and  their  associated  proteins  to  the  promoter  region. 
Since  we  also  observed  significant  hypoacetylation  of  histone  tails  at  the  promoter  region 
and  complete  reversal  of  SNAG-mediated  repression  in  presence  of  TSA,  we  strongly 
believe  that  SNAG  repression  involves  histone  deacetylases  and  that  HDAC  may  be  a 
constituent  in  the  SNAG-holo  repression  complex. 

At  present,  there  are  nine  HDAC  family  members  belonging  to  three  distinct 
families.  The  class  I  HDAC  family  members,  which  consists  of  HDAC  1,  2,  3  and  8,  are 
nuclear  while  the  class  II  members,  which  consists  of  HDAC  4,  5,  6  and  7,  are  actively 
maintained  in  the  cytoplasm  and  are  imported  to  the  nucleus  when  required.  The  class  III 
consists  of  NAD+-dependent  histone  deacetylases.  In  contrast  to  the  class  II,  and  I 
members  of  the  class  III  histone  deacetylases  are  insensitive  to  the  histone  deacetylase 
inhibitor  trichostatin-A.  Based  on  our  observations,  we  hypothesize  that  either  Ajuba 
interacts  with  a  cytosolic  HDAC  and  recruits  it  to  the  nucleus,  or  it  interacts  with  a 
HDAC  existing  in  the  nucleus  and  recruits  it  to  the  repression  complex.  In  support  of 
this,  RLIM  has  been  shown  to  interact  with  members  of  histone  deacetylase  corepressor 
complex  (44).  It  would  be  interesting  to  identify  the  nature  of  the  HDAC  involved,  and  to 
decipher  the  hitherto  unidentified  associated  proteins  that  constitutes  the  macromolecular 
repressor  complex.  This  necessitates  isolation  and  characterization  of  interacting  proteins 
that  are  present  in  both  cytoplasmic  and  nuclear  compartments. 
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Figure  Legends: 

Figure  1.  Characteristics  of  SNAG-zinc  finger  transcription  factors.  A.  Alignment  of 
human  SNAG-ZFPs.  Presence  of  a  highly  conserved  NH2-terminal  SNAG  repression 
module  and  a  variable  COOH-terminal  zinc  finger  arrays  are  highlighted.  The  Slug  and 
Scratch  ZFPs  also  contain  conserved  slug  and  scratch  domains,  respectively.  B.  SNAG 
domain  homology.  The  SNAG  repression  domain  extends  to  ~20  amino  acids,  of  which 
the  NFF-terminal  first  seven  amino  acids  are  strictly  conserved.  C.  Diagrammatic 
representation  of  the  wt  and  mutant  SNAG-PAX3  repressors.  D.  Protein  expression.  The 
expression  plasmids  mentioned  in  panel  C  were  transfected  into  COS-1  cells,  35S-labeled 
cell  extracts  were  immunoprecipitated  with  a-PAX3  IgG,  and  analyzed  by  SDS-PAGE 
and  fluorography.  E.  Gel  Shift.  Nuclear  extracts  made  from  transfected  COS-1  cells  were 
incubated  with  32P-labeled  e5  probe  and  electrophoresed  on  a  non-denaturing  PAGE.  The 
arrow  denotes  the  shifted  bands.  F.  Transcriptional  repression  competence  of  SNAG- 
PAX3  repressors.  The  wt  and  mutant  SNAG-PAX3  expression  plasmids  were 
transfected  into  NIH3T3  cells  along  with  PAX3-luciferase  reporter  and  CMV-lacZ 


plasmids.  Normalized  luciferase  activity  obtained  for  each  construct  is  depicted. 


Figure  2.  Identification  and  characterization  of  SNAG-Associated  Proteins  (SNAPs).  A. 
Characteristics  of  SNAG-LEXA  repressors.  The  wt  &  mutant  SNAG-LEXA  repressors 
were  tested  for  stable  protein  expression,  and  also  monitored  for  repression  of  a  LEXA- 
luciferase  reporter.  B&C.  Specificity  of  SNAPs.  B.  The  SNAP13  and  SNAP20  clones 
bind  to  wt  SNAG,  but  not  to  SNAG  (FLV-AAA),  or  unrelated  baits  in  the  mating  assay. 
C.  These  SNAPs  bind  to  SNAG  domains  that  retain  repression  activity,  but  not  to  either 
LEXA-DBD,  or  to  mutants  that  abolish  repression.  Note  that  the  KK-AA  mutant  is  not 
stably  expressed  either  in  COS-1  (Panel  A)  or  in  yeast  (data  not  shown),  which  accounts 
for  its  poor  association.  D.  Group  3  LIM  domain  proteins.  The  SNAG-interacting  regions 
of  SNAP  13  and  SNAP20  contain  two  NH2-terminal  LIM  domains  of  LIMD1  and  Ajuba, 
respectively.  The  characteristic  gly-,  pro-rich  regions  and  nuclear  export  signals  (NES) 
present  in  these  family  members  are  highlighted.  E.  The  signature  Cys-His  spacing 
(CX2CX16_23HX2CX2CX2CX16_2iCX2_3C/H/D  where  C  is  cysteine,  H  is  histidine,  D  is 
aspartic  acid,  and  X  is  any  amino  acid),  and  conserved  residues  (highlighted  in  black 
bars)  characteristic  to  LIM  domains  are  shown  as  exemplified  for  by  Ajuba  LIM 
domains.  F.  Phylogenetic  tree  analysis.  Sequence  conservation  between  the  LIM  domains 
of  the  indicated  three  LIM  domain  proteins  was  determined  using  DNASTAR  software. 
Figure  3.  Ajuba  functions  as  a  SNAG  corepressor.  A.  Ajuba  interacts  with  repression 
competent  SNAG  domains  in  vivo.  The  wt  &  mutant  SNAG-LEXA  plasmids  were  either 
transfected  alone  or  co-transfected  along  with  Ajuba  into  COS-1  cells.  The  V\S- labeled 
cell  extracts  were  immunoprecipitated  with  a-LEXA  or  a-MYC  antibodies  and  analyzed. 
Arrow  indicates  the  presence  of  co-precipitated  Ajuba  protein.  B.  Enhancement  of  SNAG 
mediated  transcriptional  repression  by  Ajuba.  The  wt  &  mutant  SNAG-PAX3  or  SNAG- 


LEXA  plasmids  were  either  transfected  alone  or  co-transfected  with  MYC-Ajuba. 
Relative  luciferase  activities  obtained  after  normalization  are  depicted. 

Figure  4.  SNAG  domain  co-localizes  with  Ajuba  in  a  repression-competence  dependent 
manner.  A.  Localization  patterns  of  Ajuba  and  SNAG-LEXA  proteins.  The  MYC- 
Ajuba,  wt,  and  mutant  SNAG-LEXA  plasmids  were  transfected  individually  into 
NIH3T3  cells  and  immunostaining  was  carried  out  with  a-LEXA  or  a-MYC  antibodies. 
Note  that  the  Ajuba  remains  mainly  cytoplasmic  while  the  wt  and  mutant  SNAG-LEXA 
proteins  are  predominantly  nuclear.  B.  Co-localization  of  repression  competent  SNAG- 
LEXA  proteins  with  MYC-Ajuba  protein.  The  wt  &  mutant  SNAG-LEXA  plasmids  were 
co-transfected  along  with  the  MYC-Ajuba  plasmid  into  NIH3T3  cells.  “Binding,” 
indicates  interaction  in  yeast  two-hybrid  analysis.  “Repression”  refers  to  the 
transcriptional  competence  in  reporter  assays.  A  representative  cell  is  shown  for  each 
transfected  population. 

Figure  5.  SNAG-mediated  transcriptional  repression  mechanisms  on  a  chromatinized 
locus.  A.  Development  of  a  dual  plasmid  system.  The  SNAG-PAX3-HBD  chimeric 
repressor  protein  is  constitutively  expressed  from  the  CMV  promoter.  The  reporter 
plasmid  contains  six  repeats  of  PAX3  DNA  binding  motifs  and  a  HSV  thymidine  kinase 
(TK)  promoter  that  controls  the  basal  expression  of  the  luciferase  gene.  This  plasmid  also 
contains  a  zeocinR  cassette  for  drug  selection.  The  primer-pair  PBS1&PBS2  amplify  the 
PAX3  binding  sites  (344  bp)  while  the  TKP1  &LUC  1  primer-pair  amplify  the  TK 
promoter  region  (257  bp).  B,C&D.  Chromatin  IPs  identifies  putative  components  of  the 
SNAG  repression  pathway.  B.  4-OHT  dependent  enrichment  of  the  SPHBD  at  the  PAX3 
sites.  Chromatin  prepared  from  -  or  +  4-OHT  treated  SPHL11  and  SPHL20  cells  were 


immunoprecipitated  with  a-PAX3  (10  iig)  and  analyzed  by  quantitative  PCR  using 
PBS1&PBS2  primer-pair.  Arrow  indicates  the  PAX3-site  fragment  (344  bp)  amplified.  C. 
Ajuba  and  LIMD1  proteins  were  inducibly  recruited  to  the  TK  promoter.  ChIP 
experiments  were  done  with  -  or  +  4-OHT  treated  SPHL11  cells  using  a-Ajuba  (20  id) 
and  a-LIMDl  (20  id),  a-MeK9  (5  id)  antisera.  The  immunoprecipitated  DNA  was 
amplified  using  TKP1&LUC1  primer-pair.  Arrow  denotes  the  amplified  TK  promoter 
fragment  (257  bp).  D.  4-OHT  dependent  promoter  hypoacetylation.  ChIP  experiments 
were  done  with  -  or  +  4-OHT  treated  SPHL11  &  SPHL  20  cell  lines  using  a-AcH3  IgG 
(5  ql)  or  a-AcH4  (5  id)  IgG  and  the  retained  DNA  was  tested  by  quantitative  PCR  using 
TKP1&LUC1  primer-pair.  Arrow  indicates  the  amplified  TK  promoter  fragment  (257  bp). 
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Abstract 


The  SNAIL  transcription  factor  contains  C-terminal  tandem  zinc  finger  motifs 
and  an  N-terminal  SNAG  repression  domain.  The  SNAIL  family  has  recently  emerged 
as  major  contributors  to  the  processes  of  development  and  metastasis  via  regulating 
epithelial-mesenchymal  differentiation  events  during  embryonic  development  and  tumor 
progression.  However,  the  mechanisms  by  which  SNAIL  represses  gene  expression  are 
largely  undefined.  Previously  we  demonstrated  that  AJUBA,  a  LIM  domain  protein, 
functions  as  a  co-repressor  for  SNAIL  and  may  serve  as  a  platform  for  assembly  of 
chromatin  modifying  factors.  Here,  we  describe  the  identification  of  protein  arginine 
methyltransferase  5  (PRMT5)  as  an  effector  recruited  to  SNAIL  through  direct  binding  to 
AJUBA  which  functions  to  repress  the  SNAIL  target  gene,  E-cadherin.  PRMT5  binds  to 
the  non-LIM  region  of  AJUBA,  and  is  translocated  into  the  nucleus  in  a  SNAIL  and 
AJUBA-dependent  manner.  Depletion  of  PRMT5  in  pl9  cells  stimulates  E-cadherin 
expression,  and  the  ternary  complex  can  be  found  at  the  the  proximal  promoter  region  of 
the  E-cadherin  gene,  concomitant  with  increased  arginine  methylation  of  histones  at  the 
locus.  Together,  these  data  suggest  that  PRMT5  is  an  effector  of  SNAIL-dependent  gene 
repression. 
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Introduction 


The  SNAG  family  of  zinc  finger  transcription  factors  in  vertebrates  include 
GFI-1A,  GFI-1B,  the  insulinoma-associated  protein  IA-1,  the  homeobox  protein  GSH-1, 
and  the  SNAIL/SLUG  family.  These  proteins  play  important  roles  in  the  regulation  of 
development,  stem  cell  self-renewal  and  tumor  progression  (4,  19,  44).  They  share  a 
common  set  of  functional  domains:  a  C-terminal  DNA  binding  domain  composed  of  5  -7 
regularly  spaced,  Cys2-His2  zinc  fingers  and  a  highly  conserved  N-terminal  repression 
domain  designated  SNAG.  The  SNAG  motif  was  first  identified  from  GFI-1  protein  and 
comprises  the  first  21  amino  acid  residues  in  the  N-terminus.  The  SNAG  domain  is  a 
potent  and  transferable  repression  motif  (19,  44).  However,  unlike  other  repression 
domains  which  are  associated  with  zinc  finger  proteins  such  as  the  KRAB  domain  and 
the  BTB-POZ  domain  whose  mechanisms  of  repression  are  well  established,  little  is 
known  about  the  mechanisms  of  the  SNAG  domain-mediated  repression  (8,  14). 

The  SNAIL  protein  has  emerged  as  a  potent  regulator  to  the  processes  of 
embryonic  development  and  tumor  progression  by  regulating  the  epithelial-mesenchymal 
transition  (EMT)  (4,  31).  In  mammalian  cells,  SNAIL  induces  EMT  at  least  partially 
due  to  repression  of  the  E-cadherin  gene,  thereby  altering  cell  adhesion  (5).  The  SNAIL 
protein  has  been  found  in  multiprotein  complexes  containing  histone  deacetylases, 
mSIN3A  and  LOXL2/3  (34,  35).  However,  the  biological  significance  of  these 
interactions  and  how  SNAIL  mediates  functional  protein  complex  assembly  at  specific 
promoters  in  the  context  of  chromatin  remains  undefined. 

We  previously  identified  novel  co-repressors  directly  bound  to  the  SNAG 
domains  of  GLI1A  and  SNAIL  using  yeast  two-hybrid  assays.  The  AJUBA  protein  was 
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identified  as  a  prospective  candidate  which  binds  to  the  minimal  SNAG  domain.  AJUBA 
is  a  multiple  LIM  domain-containing  protein,  and  belongs  to  the  ZYXIN  family  of  LIM 
proteins  (18).  This  family  includes  AJUBA,  LIMD1,  WTIP,  ZYXIN,  LPP,  and  TRIP6. 
The  AJUBA/ZYXIN  family  is  characterized  by  three  tandem  C-terminal  LIM  domains 
and  unique  N-terminal  regions  designated  the  preLIM  regions  (18,  23).  The  AJUBA 
protein  is  predominantly  cytoplasmic,  but  can  shuttle  between  the  nucleus  and  cytoplasm 
(24).  The  AJUBA  protein  may  function  as  a  scaffold  protein  to  assemble  multiple 
cytoplasmic  protein  complexes  involved  in  the  processes  of  cell  adhesion,  migration, 
mitosis  and  cell  differentiation  (13,  18,  20).  However,  its  role  in  the  nucleus  as  a 
regulator  of  gene  expression  is  poorly  defined. 

Detailed  characterization  of  the  interaction  between  AJUBA  and  SNAIL  by  in 
vivo  and  in  vitro  approaches  demonstrated  that  AJUBA  functions  as  a  SNAIL  co¬ 
repressor  to  repress  the  E-cadherin  gene  and  is  recruited  to  the  endogenous  E-cadherin 
promoter  in  a  SNAIL-dependent  manner  (Langer  et  al,  manuscript  submitted). 

Expression  of  Ajuba  during  Xenopus  development  parallels  that  of  Snail,  and  Ajuba 
cooperates  with  Snail  during  Xenopus  neural  crest  development.  Since  AJUBA  itself  does 
not  contain  apparent  enzymatic  activity,  we  postulated  that  AJUBA  may  recruit  other 
effectors  to  the  SNAG  domain  of  SNAIL  to  modify  chromatin  structure. 

In  the  present  study,  we  purified  AJUBA  interacting  proteins  and  describe  the 
protein  arginine  methyltransferase  5  (PRMT5)  as  a  candidate.  PRMT5  is  a  type  II 
protein  arginine  methyltransferase  and  plays  important  role  in  the  regulation  of  gene 
transcription  (26).  The  studies  provide  strong  evidence  that  PRMT5  is  a  key  component 
of  the  SNAIL  silencing  complex  through  binding  to  AJUBA. 
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Materials  and  methods 


Plasmids.  The  Myc  epitope-tagged  Ajuba  plasmids  pME X-myc-Ajuba  have 
been  described  (18).  All  Ajuba  mutants  and  truncations  were  made  using  the 
QuikChange  Site-Directed  Mutagenesis  procedures  following  the  manufacturer’s  protocol 
(Stratagene,  La  Jolla,  CA),  and  all  mutants  were  confirmed  by  DNA  sequencing.  The 
Ajuba  cDNAs  were  subcloned  from  pMEX-Myc-Ajuba  via  digestion  with  Bam  HI  and 
Xho  I  and  insertion  into  the  pcDNA3.1-N-Flag  vector  to  create  a  Flag  epitope-tagged 
AJUBA  fusion  protein  in  the  N-terminus.  The  pcDNA-RFP -Ajuba  and  pGL2-£- 
cadherin  luciferase  reporter  plasmids  have  been  previously  described(37,  40).  The 
pc  D  N  A  -  FI  ag  -  PRMT5  plasmids  was  provided  by  Dr.  Dreyfuss  (15).  The  Sport6-CMV- 
Snail  (murine)  plasmid  was  purchased  from  Open  Biosystems  (Huntsville,  AL). 

The  GST-AJUBA(244-350aa)  and  leucine  to  alanine  substitutions  were 
generated  by  PCR  amplification  of  the  DNA  fragment  encoding  aa  residues  244-350  of 
murine  AJUBA.  The  PCR  products  were  cloned  into  the  Bam  HI  and  Eco  RI  sites  of  the 
pGEX-4T-l.  The  truncated  PRMT5  plasmids  containing  PRMT5(1-170),  PRMT5(169- 
422),  and  PRMT5(421-637)  were  generated  by  amplifying  the  DNA  fragment  encoding 
the  indicated  residues  of  human  PRMT5  and  the  resulting  fragments  were  cloned  into  the 
pET-28a  vector. 

Cell  culture,  transfections  and  luciferease  reporter  assays.  HEK293  cells, 
U20S  cells  and  pl9  cells  were  maintained  in  DMEM  containing  10%  FBS,  2  mM  L- 
glutamine,  and  penicillin  (50  U/ml)/streptomycin  (50  pg/ml)  at  37  C  under  5%  CO2  in  a 
humidified  chamber. 
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For  transfection,  HEK293  cells  were  seeded  at  5  x  104  cells  per  well  in  24- well 
plates.  The  fi-galactosidase  plasmid  (50  ng)  and  pGL2-E-cad-Luc  reporter  (200  ng) 
along  with  Snail  and/or  Ajuba  encoding  plasmids  were  transiently  transfected  into  the 
cells  with  Lipofectamine  2000  reagent  (Invitrogen,  Carlsbad,  CA)  following  the 
manufacturer’s  protocol.  Twenty  four  hours  post  transfection,  cells  were  harvested  and 
lysed.  Luciferase  and  B-galactosidase  activities  were  measured  with  the  luciferase 
reporter  gene  assay  kit  (Promega,  Madison,  WI)  and  the  chemiluminescent  reporter  assay 
for  B-galactosidase  kit  (Clontech,  Mountain  View,  C A),  respectively.  The  transfection 
efficiency  among  plates  was  normalized  to  B-galactosidase  activity  and  all  transfections 
were  repeated  three  times  in  duplicate. 

Affinity  purification  of  a  native  AJUBA  complex.  To  purify  AJUBA- 
associated  proteins,  a  Flag-tagged,  full-length  Ajuba  cDNA  in  the  pcDNA3.1  vector  was 
stably  expressed  in  HEK293  cells.  Single  cell  clones  were  selected  with  G418  and 
screened  by  western  blot  using  anti-Flag  antibody.  A  cell  clone  expressing  the  Flag- 
AJUBA  protein  at  level  comparable  to  that  of  the  endogenous  AJUBA  was  chosen  for  the 
purification.  A  total  of  5xl09  cells  were  lysed  in  buffer  A  containing  20mM  Tris  HC1 
(pH  8.0),  150mM  NaCl,  2.5mM  EDTA,  0.5%  NP40,  0.2  mM  PMSF,  and  0.5  mM  DTT. 
Cell  lysates  were  pre-cleared  with  the  protein  A  agarose  beads  for  2  hours,  and  then 
incubated  with  the  anti-Flag  agarose  M2  beads  (Sigma,  St.  Fouis,  MO)  at  0.5  ml  of 
beads  per  100  mg  of  cell  lysate  for  2  hours  to  overnight  with  rotation.  The  M2  beads 
were  washed  4  times  with  buffer  BC500  containing  20  mM  Tris-HCl  (pH  7.8),  500  mM 
KC1,  0.2  mM  EDTA,  10%  glycerol,  10  mM  (3-mercaptoethanol,  0.2%  NP40,  0.2  mM 
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PMSF  and  1  ug/ml  of  aprotinin,  leupeptin,  and  pepstatin.  The  protein  complex  was 
eluted  with  the  Flag-peptides  (Sigma)  at  lOOpg/ml  in  buffer  BC100  containing  20  mM 
Tris-HCl  (pH  7.8),  50  mM  KC1,  0.2  mM  EDTA,  10%  glycerol,  10  mM  p- 
mercaptoethanol,  0.2  mM  PMSF  and  1  ug/ml  of  aprotinin,  leupeptin,  and  pepstatin.  The 
eluted  proteins  were  resolved  on  4-12%  SDS-PAGE  gels  for  western  blot,  silver  and 
colloidal  staining  analyses.  The  proteins  were  excised  from  the  gel  and  identified  by 
standard  mass  spectrometry  at  the  Wistar  Institute  Cancer  Center,  Proteomics  Core 
Facility. 

Co-immunoprecipiation,  western  blot,  immunofluorescence  and  antibodies. 

My c-Ajuba,  F\ag-PRMT5  and/or  Flag-Snail  plasmids  were  transiently  transfected  into 
HEK293  cells  and  24  hours  post  transfection,  cells  were  lysed  in  buffer  A.  Co- 
immunoprecipiations  were  performed  with  either  anti-Myc  or  anti-Flag  reagents.  The 
western  blots  and  immunofluorescences  were  previously  described  (21,  41).  Mouse 
monoclonal  anti-Myc  (Invitrogen),  anti-Flag  (Sigma),  Rabbit  polyclonal  anti-SNAIF 
(Santa  Cruz  Biotechnology,  Sant  Cruz,  CA),  anti-H4R3  (UPSTATE,  Charlottesville,  VA), 
anti-E-cadherin  (Cell  Signaling,  Danvers,  MA)  antibodies  were  purchased.  The  rabbit 
polyclonal  anti-AJUBA  antibody  was  raised  by  immunizing  rabbits  with  a  bacterial 
expressed  6-His  fusion  protein  of  murine  AJUBA  (amino  acids  1-216)  as  the  antigen. 

SiRNA  knockdown,  the  methyltransferase  inhibitor  5'-Deoxy-5'-methyl- 
thioadenosine  (MTA)  treatment  and  RT-PCR.  Smart  pool  siRNAs  targeting  murine 
Ajuba  and  PRMT5  (Dharmacon,  Lafayette,  CO)  were  transfected  into  the  cells  with  the 
Lipofectamine  2000  reagent  (Invitrogen).  5'-Deoxy-5'-methyl-thioadenosine  (Sigma) 
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was  disolved  in  DMSO.  pl9  cells  were  seeded  at  1.5xl05  cells/well  in  6-well  cell 
culture  plates  on  day  0.  MTA  was  added  into  the  media  on  day  1  at  concentrations  of 
lOOuM  and  200uM  for  48  hours. 

Total  RNA  from  pl9  cells  was  isolated  with  the  RNeasy  Kit  (Qiagen,  Valencia, 
CA).  RNA  was  treated  with  RQ  DNase  I  to  remove  any  genomic  DNA  contamination. 
Two  micrograms  of  the  treated  total  RNA  were  used  for  cDNA  synthesis  in  a  20  pi 
reaction  with  the  Superscript  II  reverse  transcriptase  (Invitrogen).  The  primer  pairs  used 
for  E-cadherin  and  GAPDH  amplification  were:  sense  5'- 

GAGAACGGTGGTCAAAGAGC-3',  anti-sense  5'-CATCTCCCATGGTGCCACAC-3, 
and  sense  5'  ACCACAGTCCATGCCATCAC,  anti- sense: 

TCCACCCCCTGTTGCTGTA  respectively.  PCR  amplification  was  carried  out  using 
Taq  DNA  Polymerase  (Promega)  at  94  C  for  15  sec,  at  60  C  for  15  sec,  and  72  C  for  60 
sec. 


Chromatin  immunoprecipitation.  Chromatin  immunoprecipitation  (ChIP) 
experiments  were  carried  out  in  HEK293  cells  stably  expressing  the  Flag -Snail  cDNA. 
HEK  293 -Flag-Shm/  and  HEK  293-vector  cell  lines  were  established  by  transfection  of 
pcDNA3.1  Flag-Snail  and  parental  vectors  into  HEK293  cells,  and  selected  with  zeocin 
at  400ng/ml.  The  expression  of  the  Flag-SNAIL  protein  was  confirmed  by  western  blot. 
To  prepare  cells  for  ChIP,  H E K 2 9 3 - F 1  a g - Sn a i l  and  HEK293-vector  cells  were  grown  in 
150  mm  plates  to  70-90%  confluency,  and  fixed  by  addition  of  574  pi  of  37% 
formaldehyde  directly  into  20  ml  of  growth  medium  to  a  final  concentration  of  1%  for  20 
minutes  in  the  cell  culture  incubator.  The  crosslinking  reaction  was  stopped  by  addition 


of  1.25  ml  of  2M  glycine  in  PBS  buffer  at  room  temperature  for  5  minutes.  Cells  were 


harvested  and  the  ChIP  assays  were  performed  according  to  the  protocol  supplied  with 
the  EZ-CHIP  kit  (UPSTATE,  Cat.  17-371).  The  immunoprecipitated  DNAs  were 
amplified  with  the  primer  set  (Primer  1)  PI  5’-AATCAGAACCGTGCAGGTCC-3’  and 
P2  5 ’  -ACAGGTGCTTTGCAGTTCCG-3’ .  This  250bp  amplicon  flanks  the  three  E- 
boxes  located  in  the  proximal  promoter  region  of  the  E-cadherin  gene.  A  second  set  of 
primers  (Primer  2),  P3  5’ -GGCTCAAGCTATCCTTGCAC-3’ ,  and  P4  5’- 
GTGCAGTGGCTCAT-GTCTGT-3’  was  used  to  amplify  a  197bp  fragment,  encoded  by 
exon  16  of  the  E-cadherin  gene.  The  PCR  fragments  were  cloned  and  their  identities 
were  confirmed  by  DNA  sequencing.  For  quantitation,  the  PCR  products  were  resolved 
on  2%  agarose  gels  and  visualized  with  ethidium  bromide. 

Results 

PRMT5  was  identified  as  one  of  the  A  JUBA -interacting  proteins 

To  isolate  potential  AJUBA-interacting  proteins  which  may  repress  the  Snail 
target  genes  such  as  E-cadherin ,  we  chose  a  cell  system  which  supports  SNAIL- mediated 
represssion.  Since  HEK293  cells  have  been  widely  used  for  purification  of  protein 
complexes,  we  first  tested  whether  SNAIL  can  repress  E-cadherin  expression  in  this  cell 
lines.  The  E-cadherin-Luc  reporter  contains  three  SNAIL  binding  sites,  which  are 
located  in  the  proximal  promoter  region  of  the  E-cadherin  gene,  a  promoter  which  is 
responsive  to  SNAIL  in  a  variety  of  cell  types  (5,  34,  39).  HEK293  cells  were 
transiently  transfected  with  Snail  and  Ajuba  encoding  plasmids  alone  or  in  combination 
along  with  the  E-cadherin-luc  reporter.  Snail  by  itself  was  able  to  repress  the 
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transcription  of  the  E-cadherin  promoter- driven  luciferase  reporter  in  a  dosage  dependent 
manner  (Figure  IB,  top  panel).  When  Ajuba  and  Snail  were  co-expressed  in  HEK293 
cells,  we  observed  increased  repression  over  Snail  alone  (Figure  IB,  low  panel).  These 
results  demonstrate  that  HEK293  cells  contain  the  factors  required  for  SNAIL  and 
AJETBA  to  repress  E-cadherin.  To  purify  these  factors,  we  selected  a  cell  clone  stably 
expressing  the  Flag-AJUBA  protein  comparable  to  that  of  the  endogenous  level  and 
performed  affinity  purification  (Figure  1C).  As  expected,  the  major  protein  eluted  in  the 
affinity  purification  migrated  at  MW  80kDa  was  the  Flag-AJUBA  protein.  Other 
proteins  migrated  at  MW  95kDa,  70kDa,  and  50  kDa  were  observed  in  multiple 
independent  purification  experiments.  The  major  protein  at  MW  95kDa  was  identified 
as  PRMT5  and  was  pursued  as  a  candidate  AJUBA-interacting  protein.  The  MEP50 
(MW  50kDa)  protein,  a  known  co-factor  for  PRMT5,  was  also  observed  in  the 
purification  (16,  17). 

To  confirm  the  interaction  between  AJUBA  and  PRMT5,  we  first  transiently 
co-transfected  full-length  My c-Ajuba  and  Flag -PRMT5  cDNAs  into  HEK293  cells. 
Reciprical  co-immunoprecipitations  showed  that  PRMT5  and  AJUBA  can  interact  with 
each  other  in  HEK293  cells  (Figure  2A).  To  confirm  that  an  AJUBA-PRMT5 
interaction  occurs  with  the  endogenous  proteins,  whole  cell  lysates  from  pl9  cells  were 
used  for  immunoprecipitation.  Indeed,  the  endogenous  AJUBA  was  able  to 
immunoprecipitate  the  endogenous  PRMT5  (Figure  2B).  Collectively,  these  data 
suggest  that  PRMT5  is  a  novel  AJUBA-interacting  protein. 
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The  leucine  rich  motif  in  the  preLIM  region  of  AJUBA  is  essential  for 
PRMT5  binding 

The  aa  sequence  of  the  AJUBA  protein  predicts  two  structurally  distinct  regions: 
the  preLIM  region  in  the  N-terminus  and  the  LIM  domains  in  the  C-terminus  (Figure  3A). 
To  determine  which  regions  in  AJUBA  are  responsible  for  the  PRMT5  binding,  plasmids 
encoding  Myc-tagged,  full-length  AJUBA  and  the  two  truncated  forms,  preLIM  and  LIM, 
were  co-transfected  with  the  Flag-PRMT5  plasmid  into  HEK293  cells.  The 
immunoprecipitation  experiments  showed  that  the  preLIM  region  bound  PRMT5,  while 
the  LIM  domain  did  not  (Figure  3B,  Lanes  2,  4,  6).  To  further  identify  which  specific 
motifs  or  residues  in  the  preLIM  region  are  critical  for  the  interaction  between  AJUBA 
and  PRMT5,  we  made  progressive  C-terminal  deletions  in  the  preLIM  region  (Figure  3A). 
These  experiments  confirmed  that  the  preLIM  bound  to  PRMT5.  Interestingly,  the 
truncated  preLIM-312  protein  showed  stronger  binding  than  that  of  the  complete  preLIM 
region,  suggesting  that  the  region  encompaning  aa  312-347  may  contain  inhibitory  motifs 
to  prohibit  PRMT5  binding  (Figure  3C,  Lanes  2-3).  The  proteins  preLIM- 279,  255  and 
210  failed  to  bind  PRMT5.  These  data  indicate  that  the  region  between  residues  279  and 
312  of  AJUBA  is  essential  for  PRMT5  binding.  Analysis  of  this  33  aa  region  (279-312) 
revealed  a  13  residue  leucine -rich  region  (DELTALLRLTVAT)  with  potential  a  helical 
character  (Figure  4A).  This  region  is  also  a  potential  nuclear  export  signal  (18).  To 
determine  the  role  of  these  leucines  in  PRMT5  binding,  we  mutated  them  individually  or 
in  combination  to  alanine.  Mutation  of  individual  leucine  residues  did  not  significantly 
affect  the  interaction,  while  simultaneous  mutation  of  leucines  292,  293  and  295  (AJUBA 
L3A)  greatly  attenuated  PRMT5  binding  (Figure  4B). 
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To  identify  the  regions  in  the  PRMT5  protein  that  can  interact  with  AJUBA,  we 
used  the  full-length  PRMT5  and  three  truncations  (Figure  4C).  These  fragments  were 
translated  in  vitro  and  used  for  in  vitro  GST-AJUBA  (244-350aa)  binding  assays.  The 
full-length  PRMT5  and  the  three  truncations  bound  to  GST-AJUBA  (wild-type),  while  no 
binding  was  observed  for  GST-AJUBA  L3A  mutant  (data  not  shown).  These  data 
suggest  that  multiple  domains  in  the  PRMT5  protein  can  interact  with  AJUBA. 

Co-localization  of  AJUBA  and  PRMT5  in  U20S  cells 

To  detect  the  subcellular  localization  of  AJUBA  and  PRMT5,  we  performed 
immunofluorescent  assays  in  U20S  cells.  Plasmids  encoding  Myc- AJUBA  and  Flag- 
PRMT5  were  transiently  transfected  into  U20S  cells.  When  expressed  alone,  both  the 
AJUBA  and  PRMT5  proteins  were  predominantly  cytoplasmic  with  similar  distribution 
patterns  (Figure  5  A,  C).  When  co-expressed,  the  distribution  of  each  protein  was  not 
significantly  changed  (Figure  5  D).  Similar  to  the  wild  type  AJUBA,  the  AJUBA  L3A 
mutant  was  also  localized  in  the  cytoplasm  and  mutation  of  the  three  leucines  to  alanines 
did  not  apparently  affect  its  localization  (Figure  5  B,E). 

SNAIL,  AJUBA  and  PRMT5  are  found  in  the  same  complex 

To  determine  whether  the  AJUBA  protein  functions  as  an  adaptor  to  bridge 
SNAIL  and  PRMT5  proteins,  we  looked  for  ternary  complex  formation  using  co- 
immunoprecipitation.  We  transfected  the  My c-Ajuba  encoding  plasmid,  together  with 
Flag -Snail  and  Flag -PRMT5  encoding  plasmids  into  HEK293  cells.  We  observed  that 
AJUBA  could  simultaneously  associate  with  PRMT5  and  SNAIL  (Figure  6A,  lane  5). 


12 


This  was  surprising  given  that  the  AJUBA  and  PRMT5  proteins  are  predominately 


cytoplasmic,  whereas  SNAIL  is  mostly  nuclear.  We  speculated  that  PRMT5  can  be 
translocated  into  the  nucleus  by  forming  a  complex  with  SNAIL  and  AJUBA.  To 
visualize  such  an  interaction,  we  transfected  Snail,  F\ag-PRMT5,  KFP-Ajuba  and/or 
pcDNA-RFP  encoding  plasmids  into  U20S  cells.  When  expressed  alone,  the  SNAIL 
protein  was  localized  in  the  nucleus,  while  the  AJUBA  and  PRMT5  proteins  were  in  the 
cytoplasm.  The  RFP  alone  was  found  in  both  the  cytoplasm  and  the  nucleus  and  the  RFP- 
AJUBA  fusion  protein  showed  identical  localization  compared  to  the  wild  type  AJUBA 
(Figure  6B).  However,  co-expression  of  AJUBA  and  SNAIL  affected  both  of  their 
localization:  the  SNAIL  protein  was  retained  in  the  cytoplasm,  and  concomitantly  a 
significant  amount  of  the  AJUBA  protein  was  localized  in  the  nucleus  (Figure  6D).  In 
contrast,  co-expression  of  PRMT5  and  SNAIL  revealed  no  apparent  effect  on  either 
localization  (Figure  6E).  Strikingly,  when  AJUBA,  SNAIL,  and  PRMT5  were  co¬ 
expressed,  a  significant  amount  of  the  PRMT5  protein  was  relocated  to  the  nucleus  and 
was  co-localized  with  nuclear  SNAIL  and  AJUBA  (Figure  6F).  Taken  together,  the 
results  suggest  that  SNAIL,  AJUBA  and  PRMT5  can  form  a  complex  in  the  nucleus. 

Modulation  of  AJUBA  and  PRMT5  in  pl9  cells  results  in  upregulation  of 
E-cadherin  expression. 

Since  we  showed  evidence  of  nuclear  localized  SNAIL- AJUBA-PRMT5 
complex,  we  sought  to  determine  whether  the  well-established  SNAIL  target  gene  E- 
cadherin  was  targeted  by  this  ternary  complex.  It  has  been  previously  found  that 
induction  of  Snail  in  pl9  cells  repressed  E-cadherin  gene  expression,  while  depletion  of 
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Snail  resulted  in  upregulation  of  the  Snail- target  genes  (9,  30).  An  siRNA  pool  targeting 
murine  Ajuba  was  transfected  into  the  pl9  cells.  The  level  of  the  AJUBA  protein  was 
significantly  decreased  by  the  siRNA  (Figure  7A).  Moreover,  the  expression  of  E- 
cadherin  at  both  the  mRNA  and  protein  levels  was  significantly  increased  (Figure  7 A,  B). 
These  results  suggest  that  AJUBA  is  involved  in  the  repression  of  E-cadherin  gene 
expression. 

To  determine  the  role  of  PRMT5  in  the  regulation  of  the  E-cadherin  gene 
expression,  we  treated  pl9  cells  with  the  methyltransferase  inhibitor  MTA,  which  has 
been  shown  to  block  PRMT5  function  (38).  Treatment  of  the  pl9  cells  with  MTA  at  the 
doses  of  lOOuM  and  200uM  stimulated  E-cadherin  expression  compared  to  the  DMSO 
control  (Figure  7C).  As  a  more  specific  approach,  we  employed  siRNA  of  knockdown 
PRMT5.  The  siRNA  pool  targeting  PRMT5  was  transfected  into  pl9  cells  and  the 
protein  levels  of  PRMT5  was  significantly  decreased  (Figure  7D).  Moreover,  E- 
cadherin  expression  was  increased  in  PRMT5  knockdown  cells  as  shown  by  western  blot 
analyses.  These  data  strongly  suggest  that  both  AJUBA  and  PRMT5  mediate  SNAIL- 
dependent  repression  of  E-cadherin  and  that  the  methyltransferase  activity  of  PRMT5  is 
required. 


SNAIL,  AJUBA  and  PRMT5  bind  to  the  endogenous  promoter  of  E- 

cadherin 

To  examine  whether  SNAIL,  AJUBA  and  PRMT5  form  a  functional  multi¬ 
protein  complex  to  repress  the  SNAIL  target  gene  in  living  cells,  we  employed  ChIP 
assays  to  examine  their  association  with  the  endogenous  promoter  of  the  E-cadherin  gene. 
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Clonal  H  E  K  293  -  FI  ag-Snail  cells  were  established  by  stable  transfection  and  protein 


expression  was  confirmed  by  western  blot  (Figure  8B).  Expression  of  exogenous  Flag- 
SNAIL  induced  morphological  changes  in  HEK293  cells  as  has  been  observed  on  other 
cell  types  (data  not  shown)  (1,  7,  22,  43).  Downregulation  of  the  E-cadherin  gene 
expression  was  observed  in  Flag-Snail  transfected  HEK293  cells  (Figure  8B).  The 
immunoprecipitated  DNA  fragments  were  examined  by  PCR  amplification  using  a 
primer  set  (Primerl)  which  flanks  the  three  SNAIF-binding  sites  located  in  the  proximal 
promoter  region  of  the  E-cadherin  gene.  A  second  primer  set  (Primer  2)  amplified  a 
fragment  which  is  present  in  exon  16  of  the  E-cadherin  gene  (Figure  8A).  We  observed 
that  the  proximal  promoter  of  the  E-cadherin  gene  flanking  the  three  SNAIF  binding 
sites  was  highly  enriched  by  antibodies  to  SNAIF,  AJUBA,  and  PRMT5  in  HEK293- 
Flag -Snail  cells.  However,  in  the  HEK293-vector  cells  this  enrichment  was  not 
observed  (Figure  8C).  Using  an  antibody  which  detects  methylated  histone  H4  at 
arginine  3  (H4R3)  we  detected  increased  H4  methylation  at  the  promoter  of  the  E- 
cadherin  gene  (Figure  8C).  The  fragment  resided  in  exon  16  was  not  enriched  by  any  of 
these  antibodies.  Together,  these  data  suggest  that  the  association  of  SNAIF,  AJUBA 
and  PRMT5  with  the  E-cadherin  gene  occurs  at  the  proximal  promoter  of  the  E-cadherin 
gene. 


Discussion 

Herein,  we  describe  the  identification  of  PRMT5  as  a  repressor  recruited  to  the 
SNAIF  complex  via  interacting  with  the  AJUBA  co-repressor.  We  demonstrated  that 
PRMT5  can  form  multi-protein  complexes  containing  SNAIF  and  AJUBA  which 
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function  to  repress  the  canonical  SNAIL  target  gene  E-cadherin.  To  support  this,  we 
showed  that  treatment  of  the  pi 9  cells  with  the  methyltransferase  inhibitor  MTA  or  with 
siRNA  targeting  PRMT5  stimulate  E-cadherin  expression.  Further,  PRMT5  was  shown 
to  specifically  bind  to  the  proximal  promoter  of  the  E-cadherin  gene  and  concomitantly, 
the  methylation  status  of  histones  at  this  locus  represented  by  H4R3  is  increased  in  the 
presence  of  the  SNAIL.  Together,  these  studies  suggest  that  PRMT5  is  a  key  mediator 
for  the  regulation  of  the  E-cadherin  gene  expression  and  the  methyltransferase  acivity  of 
PRMT5  is  involved  in  the  transcriptional  repression  of  the  SNAIL  complex. 

The  PRMT5  protein  is  a  member  of  type  II  protein  arginine  methyltransferases 
and  can  methylate  transcription  factors  and  histones  on  specific  arginine  residues  to 
regulate  gene  expression  (2,  12,  32,  33).  For  examples,  PRMT5  was  found  to  interact 
with  BRG1  and  BRM,  components  of  human  SWI/SNF  chromatin  remodeling  complex 
to  methylate  histones  H2A,  and  H4  on  arginine  3,  and  H3  on  arginine  8.  These  activities 
of  PRMT5  result  in  the  repression  of  genes  such  as  ST7  and  NM23  and  promotion  of  a 
tumorigenic  state  in  NIH3T3  cell  (32,  33).  PRMT5  can  interact  with  Blimp  1,  a  zinc 
finger  transcriptional  repressor,  and  supresses  Dhx38  gene  expression  by  methylation  of 
the  histones  H2A  and  H4  on  arginine  3  (2).  The  PRMT5  protein  has  also  been  found  to 
be  part  of  E2F  complex  in  the  cyclin  El  promoter,  correlating  with  repression  of 
transcription  of  cyclin  El  gene  (12). 

As  described  above,  this  is  good  evidence  that  PRMT5  is  involved  in  the 
transcriptional  repression.  Paradoxically,  the  majority  of  the  PRMT5  protein  at  steady 
state  is  found  in  the  cytoplasm.  How  PRMT5  is  translocated,  retained  and  targeted  to 
specific  genes  in  the  nucleus  is  not  clear.  We  demonstrate  that  PRMT5  can  be 
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translocated  into  the  nucleus  via  forming  a  complex  with  AJUBA  and  SNAIL.  This 
SNAIL  may  function  as  a  nuclear  anchor  to  target  PRMT5  to  its  target  genes. 

The  MEP50  protein  is  also  found  in  the  AJUBA-PRMT5  complex.  The  MEP50 
protein  contains  WD  motifs  and  is  constitutively  associated  with  the  PRMT5  protein  (16, 
17).  Several  WD  motif-containing  proteins  were  recently  shown  to  be  essential  for 
global  histone  methylation  and  the  regulation  of  gene  transcription  (42).  The  MEP50 
protein  was  shown  to  direct  PRMT5  to  specific  histones  and  is  indispensible  for  PRMT5- 
dependent  histone  modification  (17).  Further  work  will  be  necessary  to  establish  the  role 
of  MEP50  in  SNAIL-mediated  gene  repression  and  PRMT5  function. 

The  SNAIL  family  of  protein  play  key  roles  in  regulation  of  EMT  events  during 
the  development  and  metastasis,  and  also  serve  as  an  early  markers  for  the  malignant 
phenotype  and  prognosis  (3,  4,  6,  10,  27,  29).  Recently,  Snail  was  shown  to  be 
spontaneously  up-regulated  during  the  process  of  tumor  recurrence  in  mice  and  high  level 
of  the  Snail  expression  strongly  predicts  decreased  relapse- free  survival  in  women  with 
breast  cancer  (29).  These  observations  strongly  implicate  a  critical  role  of  Snail  in  the 
process  of  breast  cancer  recurrence.  Therefore,  identification  of  proteins  involved  in 
SNAIL-dependent  repression  will  not  only  shed  new  light  on  understanding  the 
mechanims  of  SNAIL  in  EMT  and  tumor  recurrence,  but  also  provide  new  targets  for 
potential  drug  development  and  diagnostics. 

Previous  studies  have  demonstrated  the  association  of  SNAIL  with  potential  co¬ 
regulators  through  its  SNAG  domain.  These  include  histone  deacetylases  1  and  2 
(HDAC1  and  HDAC2),  and  the  co-repressor  mSIN3A  (34).  HDACs  are  commonly 
found  in  large  protein  complexes  in  vivo  both  in  the  cytoplasm  and  in  the  nucleus  and 
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may  direct  regulation  of  gene  expression,  the  cell  cycle,  differentiation,  and  DNA  repair. 
HDAC1  and  HDAC2  have  been  shown  to  associate  with  SMRT,  the  CoREST  complex, 
mSIN3,  N-CoR,  Mi-2/NuRD  and  play  essential  roles  in  gene  silencing  (11,  36). 

However,  how  SNAIL  mediates  the  complex  assembly  remains  elusive.  Here,  we  showed 
that  AJUBA  recruits  PRMT5  via  its  preLIM  region  to  the  SNAG  domain  of  SNAIL. 

Since  both  HDAC1  and  HDAC2,  and  AJUBA-PRMT5  interact  with  the  SNAG  domain 
of  SNAIL,  it  will  be  interesting  to  examine  the  compementary  roles  of  these  enzymes  in 
SNAIL-mediated  repression. 

It  is  a  reasonable  assumption  that,  in  addition  to  PRMT5,  other  factors  are 
recruited  to  the  SNAIL  complex  via  interacting  with  AJUBA.  Like  many  other  LIM 
proteins,  AJUBA  has  been  shown  to  function  as  a  scaffold  protein  and  can  interact  with  a 
variety  of  proteins  including  transcription  regulators,  kinases,  and  cytoskeleton  proteins 
(13,  18,  20,  23-25,  28,  37).  How  these  interactions  are  spatially  and  temporally  regulated 
should  shed  new  light  on  the  roles  of  these  important  and  clinically  relevent  transcription 
factors. 
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Figure  1.  Affinity  purification  of  AJUBA  interacting  proteins.  (A)  the  diagram 
showing  the  architectures  of  the  SNAIL  and  AJUBA  proteins.  (B)  AJUBA  can  augment 
SNAIL-mediated  repression  on  the  promoter  luciferase  reporter  activities  of  E-caclherin 
in  HEK293  cells.  (C)  colloidal  staining  shows  the  potential  AJUBA-interacting  proteins 
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purified  from  HEK293  cells.  *  represents  Actins.  The  pre-stained  molecular  weight 
marker  (Benchmarker,  Invitrogen)  migrates  15-20kDa  faster  than  predicted  MW. 

Figure  2.  PRMT5  is  a  novel  AJUBA-interacting  protein.  (A)  the  exogenously  expressed 
PRMT5  and  AJUBA  proteins  interact  in  HEK293  cells.  (B)  the  endogenous  AJUBA  and 
PRMT5  proteins  interact  in  pl9  cells.  Immunoprecipitation  was  performed  with  anti- 
AJUBA  polyclonal  antibody  and  pre-immune  rabbit  IgG  was  used  as  control.  The 
immunoprecipitated  proteins  were  resolved  on  SDS-PAGE  gel  and  western  blot  was 
performed  with  monoclonal  anti-PRMT5  antibody. 

Figure  3.  PRMT5  binds  to  a  single  domain  in  the  preLIM  region.  (A)  the  diagram 
shows  the  progressive  deletions  of  AJUBA.  (B)  the  preLIM  region  of  AJUBA  binds  to 
PRMT5.  (C)  the  region  between  residues  279  and  312  of  AJUBA  bind  to  PRMT5. 
Plasmids  were  transfected  into  HEK293  cells.  Immunoprecipitations  were  carried  out 
with  anti-Myc  antibody,  and  western  blots  were  performed  with  anti-Flag  antibody. 

Figure  4.  The  leucine  residues  in  the  region  between  residues  279  and  312  of  AJUBA  are 
essential  for  PRMT5  binding.  (A)  the  diagram  shows  the  substitutions  for  leucine 
residues  in  AJUBA.  (B)  simultaneous  mutation  of  the  three  leucines  to  alanine  abolishes 
the  interaction  between  AJUBA  and  PRMT5.  Co-immunoprecipitation  was  carried  out  in 
HEK293  cells  as  in  Figure  2.  (C)  multiple  regions  in  the  PRMT5  protein  can  bind 
AJUBA.  Bacterially  expressed  and  purified  GST-AJUBA  (244-350aa)  and  the  indicated 
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mutant  proteins  were  incubated  with  in  vitro  translated  PRMT5  proteins,  and  the  binding 
data  are  summarized  in  the  figure. 

Figure  5.  AJUBA  and  PRMT5  are  co-localized  in  U20S  cells.  The  plasmids  encoding 
Myc-AJUBA  and  Flag-PRMT5  were  transiently  transfected  into  U20S  cells  and 
immunofluorescent  images  were  taken  with  confocal  microscopy.  (A)  AJUBA  is 
predominately  cytoplasmic  when  expressed  alone.  (B)  the  AJUBA  L3A  mutant  is  also 
localized  in  the  cytoplasm  and  mutation  of  the  three  leucines  to  alanine  does  not 
apparently  affect  its  localization.  (C)  PRMT5  proteins  is  cytoplasmic  with  similar 
distribution  patterns  to  AJUBA.  (D)  co-expression  of  AJUBA  and  PRMT5.  (E)  co¬ 
expression  of  AJUBA  L3A  and  PRMT5. 

Figure  6.  The  SNAIL,  AJUBA  and  PRMT5  proteins  are  found  in  the  same  complex.  (A) 
the  AJUBA  protein  can  simultaneously  immunoprecipiate  SNAIL  and  PRMT5.  Plasmids 
encoding  My c-Ajuba,  Flag -Snail  and  Flag -PRMT5  were  transiently  transfected  into 
HEK293  cells,  and  co-immunoprecipiation  was  performed  with  anti-Myc  antibody  and 
blotted  with  anti-Flag  antibody.  (B)  when  expressed  alone,  the  SNAIL  protein  is 
localized  in  the  nucleus,  while  the  RFP- AJUBA  and  PRMT5  proteins  are  in  the 
cytoplasm,  the  RFP  alone  is  found  in  both  cytoplasm  and  nucleus.  (C)  co-expression  of 
RFP  and  SNAIL  does  not  affect  their  localization.  (D)  co-expression  of  AJUBA  and 
SNAIL  affects  both  of  their  localization:  the  SNAIL  protein  is  retained  in  the  cytoplasm, 
and  concomitantly  a  significant  amount  of  the  AJUBA  protein  is  localized  in  the  nucleus. 
(E)  co-expression  of  PRMT5,  RFP  and  SNAIL  reveals  no  apparent  effect  on  either 
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localization.  (F)  co-expression  of  RFP-AJUBA,  SNAIL,  and  PRMT5  results  in  a 
significant  amount  of  the  PRMT5  protein  being  relocated  to  the  nucleus  and  co-localized 
with  nuclear  SNAIL  and  AJUBA. 

Figure  7.  Modulation  of  AJUBA  and  PRMT5  in  pl9  cells  results  in  upregulation  of  E- 
cadherin  expression.  (A)  western  blot  shows  that  siRNA  targeting  AJUBA  can  stimulate 
E-cadherin  gene  expression  in  pl9  cells.  (B)  RT-PCR  analysis  of  the  E-cadherin  mRNA 
level  in  si -Ajuba  pl9  cells.  (C)  the  methyltransferase  inhibitor  5'-Deoxy-5'- 
methylthioadenosine  (MTA)  stimulates  E-cculherin  expression  in  pl9  cells.  (D)  similarly, 
siRNA  knockdown  PRMT5  in  pl9  cells  results  in  upregulation  of  E-cadherin  gene 
expression  shown  by  western  blot. 

Figure  8.  SNAIL,  AJUBA  and  PRMT5  are  associated  with  the  E-cadherin  gene  at  the 
proximal  promoter.  (A)  the  diagram  ilustrating  the  E-cadherin  promoter  and  PCR 
primers  used  for  ChIP.  A,  B  and  C  in  the  boxes  indicates  the  three  SNAIL-binding  sites 
in  the  promoter.  (B)  western  blot  analysis  of  the  Flag-SNAIL,  AJUBA,  PRMT5  E- 
cadherin  protein  expression  in  HEK293  cells  indicates  that  the  E-cadherin  gene  is  down- 
regulated  by  overexpression  of  Flag-SNAIL.  (C)  PCR  analysis  of  the 
immunoprecipitated  DNA  fragments. 

Figure  9.  The  model  for  AJUBA  and  PRMT5  in  SNAIL-mediated  gene  repression. 
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